(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
7 December 2000 (07.12.2000) 




PCT 



(10) International Publication Number 

WO 00/73469 A2 



(51) International Patent Classification 7 : C12N 15/54, 

9/12, 15/11, 5/12, C07K 16/40, A61K 38/00, G01N 33/68 

(21) International Application Number: PCT/USOO/14842 

(22) International Filing Date: 26 May 2000 (26.05.2000) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/136,503 



28 May 1999(28.05.1999) US 



(71) Applicant (for all designated Slates except US): SUGEN, 
INC. PUS/US]; 230 East Grand Avenue, South San Fran- 
cisco, CA 94080 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): PLOWMAN, Gre- 
gory, D. [US/US]; 4 Honeysuckle Lane, San Carlos, CA 
94070 (US). MARTINEZ, Ricardo [US/US]; 984 Carrier 
Lane, Foster City, CA 94404 (US). WHYTE, David 
[US/US]; 2623 Barclay Way, Belmont, CA 94002 (US). 
SUDERSANAM, Sucha {US/US]; 20 Corte Patencio. 
Greenbrae, CA 94904 (US). 



(74) Agents: WARBURG, Richard, J. et al.; Brobeck, Phleger 
& Harrison LLP, 12390 El Camino Real, San Diego, CA 
92130 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG. BR, BY, CA, CH, CN, CR, CU, CZ, DE, 
DK, DM, EE, ES, Fl, GB, GD, GE, GH, GM, HR, HU, ID, 
IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, 
LU, LV, MA. MD, MG, MK, MN, MW, MX, NO, NZ, PL, 
PT, RO, RU, SD, SE, SG, SL SK, SL, TJ, TM, TR, TT, TZ, 
UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT. LU, MC, NL, PT, SE), OAP1 patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 

For two- letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

as 
tt 

© ■ 

O (54) Title: PROTEIN KINASES 

o 

(57) Abstract: The present invention relates to novel kinase polypeptides, nucleotide sequences encoding the novel kinase polypep- 
^ tides, as well as various products and methods useful for the diagnosis and treatment of various kinase-related diseases and conditions. 



WO 00/73469 



PCT/US00/14842 



1 

DESCRIPTION 
PROTEIN KINASES 

FIELD OF THE INVENTION 
5 The present invention relates to novel kinase polypeptides, nucleotide sequences 

encoding the novel kinase polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase-related diseases and conditions. 

BACKGROUND OF THE INVENTION 
10 The following description of the background of the invention is provided to aid in 

understanding the invention, but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. One of the key 

15 biochemical mechanisms of signal transduction involves the reversible phosphorylation of 

proteins, which enables regulation of the activity of mature proteins by altering their 
structure and function. 

Protein phosphorylation plays a pivotal role in biological signal transduction. 
Among the biological functions controlled by protein phosphorylation are the following: 

20 cell division; differentiation and death (apoptosis); cell motility and cytoskeletal structure; 

control of DNA replication, transcription, splicing and translation; protein translocation 
events from the endoplasmic reticulum and Golgi apparatus to the membrane and 
extracellular space; protein nuclear import and export; regulation of metabolic reactions, 
etc. Abnormal protein phosphorylation is widely recognized to be causally linked to the 

25 etiology of many diseases including cancer as well as immunologic, neuronal and 

metabolic disorders. 

The most common phospho-acceptor amino acid residues are serine, threonine and 
tyrosine. Phosphorylation in histidine has also been observed in bacteria. The presence of 
a phosphate moeity modulates protein function in multiple ways. A common mechanism 
30 includes changes in the catalytic properties (V max and K m ) of an enzyme leading to its 
activation or inactivation. A second widely recognized mechanism involves promoting 
protein-protein interactions. An example of this is the tyrosine autophosphorylation of the 
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ligand-activated EGF receptor tyrosine kinase. This event triggers the high-affinity 
binding to the phosphotyrosine residue on the receptor's C-terminal intracellular domain 
to the SH2 motif of the adaptor molecule Grb2. Grb2 in turn binds through its SH3 motif 
to a second adaptor molecule, such as SHC. The formation of this ternary complex 
acivates the signaling events that are responsible for the biological effects of EGF. Serine 
and threonine phosphorylation events have also being recently recognized to exert their 
biological function through protein-protein interaction events mediated by the high- 
affinity binding of phosphoserine and phosphothreonine to WW motifs present in a large 
variety of proteins (Lu, PJ. et al (1999) Science 283:1325-1328). A third important 
outcome of protein phosphorylation is changes in the subcellular localization of the 
substrate. As an example, nuclear import and export events in a large diversity of proteins 
are regulated by protein phosphorylation (Drier E.A. et al (1 999) Genes Dev 13: 556- 
568). 

Protein kinases are one of the largest families of eukaryotic proteins with several 
hundred known members. These proteins share a 250-300 amino acid domain that can be 
subdivided into 12 distinct subdomains that comprise the common catalytic core structure. 
These conserved protein motifs have recently been exploited using PCR-based and 
bioinformatic strategies leading to a significant expansion of the known kinases. Multiple 
alignment of the sequences in the catalytic domain of protein kinases and subsequent 
parsimony analysis permits their segregation into a dendrogram reflecting the relatedness 
of their catalytic domains (Fig. 1). In this manner, related kinases are clustered into 
distinct branches or subfamilies including: tyrosine kinases, cyclic-nucleotide-dependent 
kinases, calcium/calmodulin kinases, cyclin-dependent kinases and MAP-kinases, serine- 
threonine kinase receptors, and several other less defined subfamilies. 

We have recently completed a systematic analysis of the protein kinases present in 
C elegans, the multicellular organism whose entire DNA sequence has been determined. 
We identified 473 unique kinase profiles including 398 full-length conventional kinases, 
and 20 additional proteins that may function as atypical protein kinases. (Plowman G.D. 
etal (1999), Proc. Natl. Acad. Sci. 96:13603-13610). 

Using parsimony analysis, the protein kinases may be divided into 4 major groups: 
AGC, CAMK, CMGC and tyrosine kinases. In addition, there are a number of minor yet 
distinct families, including the STE and casein kinase 1, families related to worm- or 
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fungal-specific kinases, and a family designated "other" to represent several smaller 
families. In addition, we designate an "atypical" family to represent protein kinases whose 
catalytic domain has little or no primary sequence homology to conventional kinases, 
including the A6 kinases and PI3 kinases. 

The AGC kinases are basic amino acid-directed enzymes that phosphorylate 
residues found proximal to Arg and Lys. Examples of this group are the cyclic nucleotide- 
dependent kinases, G protein kinases, NDR or DBF2 and the ribosomal S6 kinases. 

The CAMK group kinases are also basic amino acid-directed kinases. They include 
the Ca2+/calmodulin-regulated and AMP-dependent protein kinases, myosin light chain 
kinases, checkpoint 2 kinases (CHK2) and EMK-related protein kinases. The EMK family 
of STK are involved in the control of cell polarity, micotubule stability and cancer. One 
member of the EMK family, C-TAK1 has been reported to control entry into mitosis by 
activating Cdc25C which in turn dephosphorylates Cdc2. 

CMGC group kinases are "proline-directed" enzymes phosphorylatmg residues 
that exist in a proline-rich context. They include the cyclin-dependent kinases (CDKs), 
mitogen-activated kinases (MAPKs), GSK3s and CLKs. Most CMGC kinases have 
larger-than-average kinase domains owing to the presence of insertions within subdomains 
X and XI. 

The tyrosine kinase group encompass both cytoplasmic (i.e. src) as well as 
transmembrane receptor tyrosine kinases (i.e. EGF receptor). These kinases play a pivotal 
role in the signal transduction processes that mediate cell proliferation, differentiation and 
apoptotis. 

Group members that define smaller, yet distinct phylogenetic branches of 
conventional kinases include the elongation factor 2 kinases (EIFKs); homologues of the 
yeast sterile family kinases (STE) which refers to 3 classes of kinases which lie 
sequentially upstream of the MAPKs; mixed lineage kinases (MLKs); Lim-domain 
containing kinases (LIMKs); Calcium-calmodulin kinase kinases (CAMKK), dual-specific 
tyrosine kinases (DYRK), integrin receptor associated kinase (IRAK); testis-specific 
kinases (TSK); UNC-51 related kinases (UNC); several families that are close 
homologues to worm (C26C2.1, YQ09, ZC581.9, YFL033c, C24A1 .3), Drosophila 
(SLOB), or yeast (YDOD_sp, YGR262_sc) kinases, and others that are "unique" and 
don't cluster into any obvious family. 
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SUMMARY OF THE INVENTION 
Through a search of the EST database for homologies to the conserved catalytic 
kinase domain of protein kinases, hundreds of mammalian members of known and 
previously unidentified protein kinase families and groups have been identified as part of 

5 the present invention. Multiple alignment and parsimony analysis of the catalytic domain 
reveals that approximately half of these protein kinases cluster into 10 known groups, with 
the other half perhaps defining novel groups. Classification in this manner has proven 
highly accurate not only in predicting motifs present in the remaining non-catalytic portion 
of each protein, but also in their regulation, substrates, and signaling pathways. The 

1 0 present invention includes the partial or complete sequence of new protein kinases, their 
classification, predicted or deduced protein structure, and a strategy for elucidating their 
biologic and therapeutic relevance. 

Thus, a first aspect of the invention features an isolated, enriched, or purified 
nucleic acid molecule encoding a kinase polypeptide selected from the gTOup consisting 

15 SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ 1DN0:129, SEQ ID NO:130, SEQ ID N0.131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, 
SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, 

20 SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, 

SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 156, 
SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , 
SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 
SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 

25 SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, 

SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , 
SEQ ID NO: 199, SEQ ID NOT 93, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, 

30 SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , 
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SEQ ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 
5 SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 

SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242. 

By "isolated" in reference to nucleic acid is meant a polymer of nucleotides 
conjugated. to each other,, including DNA and ..RNA, that is isolated from a natural source 

10 or that is synthesized. The isolated nucleic acid of the present invention is unique in the 

sense that it is not found in a pure or separated state in nature. Use of the term "isolated" 
indicates that a naturally occurring sequence has been removed from its normal cellular 
(i.e., chromosomal) environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

15 the only nucleotide chain present, but that it is essentially free (about 90 - 95% pure at 

least) of non-nucleotide material naturally associated with it, and thus is distinguished 
from isolated chromosomes. 

By the use of the term "enriched" in reference to nucleic acid is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction (2 - 5 fold) of 

20 the total DNA or RNA present in the cells or solution of interest than in normal or 

diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply that 

25 there are no other DNA or RNA sequences present, just that the relative amount of the 

sequence of interest has been significantly increased. The term "significant" is used to 
indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 1 0 fold or even more. The term also does not imply that there is no 

30 DNA or RNA from other sources. The other source DNA may, for example, comprise 

DNA from a yeast or bacterial genome, or a cloning vector such as pUCl 9. This term 
distinguishes from naturally occurring events, such as viral infection, or tumor type 
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growths, in which the level of one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover only those situations in which a 
person has intervened to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in purified 
5 form. The term "purified" in reference to nucleic acid does not require absolute purity 

(such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural 
level this level should be at least 2-5 fold greater, e.g., in terms of mg/mL). Individual 
clones isolated from a cDNA library may be purified to electrophoretic homogeneity. The 

10 claimed DNA molecules obtained from these clones could be obtained directly from total 

DNA or from total RNA. The cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially purified naturally occurring substance 
(messenger RNA). The construction of a cDNA library from mRNA involves the creation 
of a synthetic substance (cDNA) and pure individual cDNA clones can be isolated from 

3 5 the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 

process which includes the construction of a cDNA library from mRNA and isolation of 
distinct cDNA clones yields an approximately 10 6 -fold purification of the native message. 
Thus, purification of at least one order of magnitude, preferably two or three orders, and 
more preferably four or five orders of magnitude is expressly contemplated. 

20 By a "kinase polypeptide" is meant 10 (preferably 20, more preferably 40, most 

preferably 75) or more contiguous amino acids set forth in an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 

25 SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO:l 36, SEQ ID NO:l 37, SEQ ID NO: 13 8, 

SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO.T44, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 

30 SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, 
SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, 
SEQ ID NO: 169, SEQ ID NO:l 70, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, 
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SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
5 SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, 
SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ED NO:202, SEQ ED NO:203, 
SEQ ED NO:204, SEQ ED NO.205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, 
SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:211, SEQ ED NO:212, SEQ ED NO:213, 
SEQEDNO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:21 8, 

10 SEQ ED NO:219, SEQ ID NO:220, SEQ EDNO:221, SEQ ED NO:222, SEQ ID NO:223, 

SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ED NO:230, SEQ ED NO:231 , SEQ ID NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or functional 

15 derivatives thereof as described herein. For sequences for which the full-length sequence 

is not given, the remaining sequences can be determined using methods well-known to 
those in the art and are intended to be included in the invention. In certain aspects, 
polypeptides of 1 00, 200, 300 or more amino acids are preferred. The kinase polypeptide 
can be encoded by a full-length nucleic acid sequence or any portion of the full-length 

20 nucleic acid sequence, so long as a functional activity of the polypeptide is retained. By 
"functional" domain is meant any region of the polypeptide that may play a regulatory or 
catalytic role as predicted from amino acid sequence homology to other proteins or by the 
presence of amino acid sequences that may give rise to specific structural conformations 
(i.e., coiled-coils). For some purposes, polypeptide domains are preferred, including, but 

25 not limited to, N-terminal, catalytic/kinase and C-terminal. 

The amino acid sequence will be substantially similar to a sequence selected from 
the group consisting of those set forth in SEQ ED NO:122, SEQ ED NO:123, SEQ ID 
NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ ED NO:127, SEQ ED NO:128, SEQ ID 
NO:129, SEQ ED NO: 130, SEQ ED NO:131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID 

30 NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 
NO: 1 39, SEQ ED NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43 , SEQ ID 
NO: 1 44, SEQ ED NO: 1 45, SEQ ED NO: 1 46, SEQ ED NO: 1 47, SEQ ED NO: 148, SEQ ID 
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NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 
NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID 
NO: 159, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 163, SEQ ID 
NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID 
NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ED NO: 178, SEQ ID 
NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID 
NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
-NO:.l 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID 
NO: 1 94, SEQ ID NO: 1 95 , SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID 
NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229 ; SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or fragments thereof. A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID 
NO:127, SEQ IDNO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO:132, SEQ IDNO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO:l 57, SEQ ID NO: 1 58, SEQ ID NO:l 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO:l 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO:l 75, SEQ ID NO: 1 76, SEQ ID 
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NO: 1 77, SEQ ID NO: 178, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
5 NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:21 8, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 

1 0 NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 

1 5 and most preferably 99-1 00%) to a sequence selected from the group consisting of those 

set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ 
ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ 
ID NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ 

20 ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ 
ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ 
IDNO:151,SEQIDNO:152, SEQIDNO:153, SEQIDNO:154, SEQIDNO:155, SEQ 
ID NO: 1 56, SEQ ID NO:l 57, SEQ ID NO: 1 58, SEQ ED NO: 159, SEQ ID NO: 1 60, SEQ 
ID NO:l 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ 

25 ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ 

ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ 
ID NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 

30 ID NO:191, SEQ ED NO:199, SEQ ED NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ 
ID NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ 
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ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID N0:21 5, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:22 1 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224 3 SEQ ID NO:225, SEQ 
5 ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239 5 SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242 or portions of or the entire corresponding full-length 
amino acid sequences. 

10 By "identity" is meant a property of sequences that measures their similarity or 

relationship. Identity is measured by dividing the number of identical residues between 
two sequences (either full-length or a defined domain) by the total number of residues in 
the known sequence, or the domain of the known sequence, and multiplying the product 
by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences 

15 that are less highly conserved, and have replacements and substitutions, have a lower 

degree of identity. "Gaps" are spaces in an alignment that can result from aligning a novel 
sequence with a known sequence when the novel sequence has additions or deletions of 
amino acids in comparison with the known sequence. These gaps do not factor into the 
assessment of % identity using the sbove calculation. 

20 Those skilled in the art will recognize that several computer programs are also 

available for determining sequence identity using standard parameters, for example, Blast 
(Altschul, et al (1997) Nucleic Acids Res. 25:3389-3402), Blast2 (Altschul, et al (1990) 
J. Mol. Biol. 215:403-410), and Smith-Waterman (Smith, et al (1981) J. Mol. Biol. 
147:195-197). 

25 In preferred embodiments, the invention features isolated, enriched, or purified 

nucleic acid molecules encoding a kinase polypeptide comprising a nucleotide sequence 
that: (a) encodes a polypeptide having an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 

30 ID NO:130, SEQ ID NO:131, SEQ ED NO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ED NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 
ID NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ 
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20 ID NO:240, SEQ ED NO:24 1 , and SEQ ED NO:242, or the corresponding full-length 

amino acid sequence, or fragments thereof. A sequence that is substantially similar to a 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO: 128, SEQ ED NO: 129, SEQ ED NO: 130, SEQ ID NO:131, SEQ ID NO: 132, SEQ ID 

25 NO:133, SEQ EDNO:134, SEQ ED NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 

NO: 138, SEQ ID NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 

30 NO:158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ED NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 168, SEQ ID NO:169, SEQ ID NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ID 
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NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO: 183, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO : 1 87, SEQ ID 
NO: 188, SEQ ID NO:l 89, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
5 NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:21 6, SEQ ID NO:21 7, SEQ ID 

10 NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ED 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242 will 

] 5 have at least 75% identity (preferably 90%, more preferably at least 95% and most 



preferably 99-1 00%) to the sequence selected from the group consisting of those set forth 
in SEQ ED NO: 1 22, SEQ ID NO: 1 23, SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ ID 
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NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID 
N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ IDNO:220, SEQ ID 
5 NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242; (b) is the complement of the nucleotide sequence of (a); (c) 

1 0 hybridizes under highly stringent conditions to the nucleotide molecule of (a) and encodes 
a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having an 
amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 1 27, SEQ ED NO: 1 28, SEQ ED NO: 1 29, SEQ ED NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID 

15 NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID 

NO:137, SEQ EDNO:138, SEQ ED NO:139, SEQ ED NO:140, SEQ ED NO:141, SEQ ID 
NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ID 
NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID 
NO:l 52, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 

20 NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO:172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 

25 NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO:187, SEQ ED NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ED 

30 NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED 
NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ EDNO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
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NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
5 ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 

A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO:134, SEQ ID NO: 135, 

10 SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, 

SEQIDNO:141,SEQIDNO:142, SEQIDNO:143, SEQ ID NO:144, SEQIDNO:145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO: 151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO: 1.55, 
SEQ ID NO: 1 56, SEQ ID NO: 157, SEQ ID NO:l 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 

15 SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. 

SEQ ID NO : 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 175, 
SEQ ID NO: 1 76, SEQ ID NO:l 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 

20 SEQ ID N0:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, 

SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO.203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 

25 SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:2 1 6, SEQ ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 

30 SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ED NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to the sequence of SEQ ID 



WO 00/73469 



15 



PCT/US00/14842 



NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ ID 
NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ED NO:130, SEQ ID N0:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ED NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
5 NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ED NO:161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 

10 NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 

NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ED 

15 NO:199, SEQ EDNO:193, SEQ ED NO:194, SEQ ED NO:195, SEQ ID NO:196, SEQ ID 

NO:197, SEQ ID NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

20 NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 

25 ED NO:242, except that it lacks one or more, but not all, of a domain selected from the 

group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil structure region, a proline-rich region, a spacer region, an insert, and a C- 
terminal tail; (e) is the complement of the nucleotide sequence of (d); (f) encodes a 
polypeptide having an amino acid sequence selected from the group consisting of those set 

30 forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID 
NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
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NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID 
NO:141, SEQ ID N0.142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID 
NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID 
NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
5 NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID 
NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO: 166, SEQ ED NO: 167, SEQ EDNO:168, SEQEDNO:169, SEQ ED NO: 170, SEQ ED 
NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ID NO: 175, SEQ ID 
NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID 

10 NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 

NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID 
NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201 , SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 

1 5 NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID 

NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ED NO:2 1 3, SEQ ED NO:2 1 4, SEQ ID NO:2 1 5, SEQ ID 
NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID 
NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID 
NO:226, SEQ ID N0.227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO.230, SEQ ID 

20 NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED 
NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID 
NO:241, and SEQ ED NO:242, or the corresponding full-length amino acid sequence, or 
fragments thereof. (The domain demarcations of the polypeptides of the invention are 
indicated in Table 2 by reference to the kinase domain.) A sequence that is substantially 

25 similar to a sequence selected from the group consisting of those set forth in SEQ ID 

NO: 1 22, SEQ ED NO: 1 23, SEQ ED NO: 1 24, SEQ ED NO: 125, SEQ ID NO: 1 26, SEQ ID 
NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 131, SEQ ID 
NO: 1 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ED NO.T 35, SEQ ID NO: 1 36, SEQ ID 
NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, SEQ ID 

30 NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ ED 
NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ ED NO:150, SEQ ED NO:151, SEQ ED 
NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NOT 56, SEQ ID 
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NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
5 NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

NO:l 82, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 188, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 

1 0 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED 
NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

1 5 NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to the sequence selected from the group consisting of those 

20 set forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ 
ED NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ 
ED NO: 131, SEQ ED NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, SEQ 
ED NO:l 36, SEQ ED NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ 
ED NO: 14], SEQ ED NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ 

25 ID NO: 1 46, SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ 
ID NO: 151, SEQ ID NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ 
ED NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ 
ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ 
ED NO: 166, SEQ ED NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, SEQ 

30 ID NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ 
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ID NO: 186, SEQIDNO:187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ 
ID NO: 191,. SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ 
ID NO: 196, SEQ ID NO: 197, SEQ ID NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ 

5 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ 

ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ED NO:217 ; SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

1 0 ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 

ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ 
ED NO:241, and SEQ ED NO:242; (b) is the complement of the nucleotide sequence of (a); 
(c) hybridizes under highly stringent conditions to the nucleotide molecule of (a) and 
encodes a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having 

1 5 an amino acid sequence selected from the group consisting of those set forth in SEQ ID 



NO: 122, 


SEQ 


EDNO:123, 


SEQ 


ID 


NO: 124, 


SEQ 


ED 


NO:125, 


SEQ 


ED 


NO:126, 


SEQ 


ID 
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NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID N0:216, SEQ ID 
NO:217, SEQ ID N0:21 8, SEQ ID N0:21 9, SEQ ID NO:220, SEQ ID NO:221 , SEQ ID 
NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
5 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 

10 those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 

SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 
SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQIDNO:141, SEQ ID NO: 142, SEQ ID NO: 143, SEQIDNO:144, SEQ IDNO:145, 

15 SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, 

SEQ ID NO: 1 51 , SEQ ID NO:l 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, 
SEQ ID NO: 156, SEQ ED NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 
SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. 
SEQ ID NO: 166, SEQ ID NO:167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, 

20 SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, 

SEQ ID NO: 176, SEQ ID NO:177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ ID NO: 180, 
SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 
SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ED NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ID NO: 1 91 , SEQ DD NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 

25 SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ DD NO: 1 99, SEQ ID NO:200, 
SEQ DD NO:201, SEQ DD NO:202, SEQ DD NO:203, SEQ DD NO:204, SEQ DD NO:205, 
SEQ DD NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ID NO:211, SEQ DD NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ DD NO:217, SEQ DD NO:218, SEQ DD N0.219, SEQ DD NO:220, 

30 SEQ DD NO:221, SEQ DD NO:222, SEQ DD NO:223, SEQ ED NO:224, SEQ DD N0:225, 

SEQ DD NO:226, SEQ DD NO:227, SEQ DD NO:228, SEQ DD NO:229, SEQ DD NO:230, 
SEQ DD NO:231, SEQ DD NO:232, SEQ DD NO:233, SEQ DD NO:234, SEQ DD N0.235, 
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SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ED NO:241, and SEQ ID NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to a domain of a polypeptide 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, 



5 


SEQ 


ID NO: 124, 


SEQ 


IDNO:125, 


SEQ 


ID 


NO:126, 


SEQ 


IDNO:127, 


SEQ 


ID 


NO: 128, 




SEQ 


ED NO: 129, 


SEQ 


ID NO: 130, 


SEQ 


ID 


NO:131, 


SEQ 


ID NO: 132, 


SEQ 


ID 


NO:133, 




SEQ 


ID NO: 134, 


SEQ 


IDNO:135, 


SEQ 


ED 


NO:136, 


SEQ 


IDNO:137, 


SEQ 


ID 


NO:138, 




SEQ 


ID NO: 139, 


SEQ 


ED NO: 140, 


SEQ 


ED 


NO:141, 


SEQ 


ED NO: 142, 


SEQ 


ID 


NO:143, 




SEQ 


ID NO: 144, 


SEQ 


EDNO:145, 


SEQ 


ID 


NO: 146, 


SEQ 


EDNO:147, 


SEQ 


ID 


NO: 148, 


10 


SEQ 


ID NO: 149, 


SEQ 


ED NO: 150, 


SEQ 


ED 


NO:151, 


SEQ 


ED NO: 152, 


SEQ 


ED 


NO: 153, 




SEQ 


ID NO: 154, 


SEQ 


IDNO:155, 


SEQ 


ED 


NO: 156, 


SEQ 


ED NO: 157, 


SEQ 


ED 


NO:158, 




SEQ 


ID NO: 159, 


SEQ 


ED NO: 160, 


SEQ 


ED 


NO:161, 


SEQ 


ID NO: 162, 


SEQ 


ID 


NO:163, 




SEQ 


ID NO: 164, 


SEQ 


EDNO:165. 


SEQ 


ID 


NO: 166, 


SEQ 


IDNO:167, 


SEQ 


ID 


NO:168, 




SEQ 


ID NO: 169, 


SEQ 


ED NO: 170, 


SEQ 


ID 


NO:171, 


SEQ 


ID NO: 172, 


SEQ 


ID 


NO:173, 


15 


SEQ 


ID NO: 174, 


SEQ 


EDNO:175, 


SEQ 


ID 


NO: 176, 


SEQ 


IDNO:177, 


SEQ 


ID 


NO:178, 




SEQ 


ED NO: 179, 


SEQ 


ID NO: 180, 


SEQ 


ED 


NO:181, 


SEQ 


IDNO:182, 


SEQ 


ID 


NO:183, 




SEQ 


ED NO: 184, 


SEQ 


IDNO:185, 


SEQ 


ID 


NO:186, 


SEQ 


ID NO: 187, 


SEQ 


ID 


NO:188, 




SEQ 


ED NO: 189, 


SEQ 


ED NO: 190, 


SEQ 


ED 


NO:191, 


SEQ 


ID NO: 199, 


SEQ 


ID 


NO: 193, 




SEQ 


ID NO: 194, 


SEQ 


ED NO: 195, 


SEQ 


ID 


NO: 196, 


SEQ 


ID NO: 197, 


SEQ 


ID 


NO: 198, 


20 


SEQ 


ED NO: 199, 


SEQ 


ID NO:200, 


SEQ 


ED 


NO:201, 


SEQ 


ED NO:202, 


SEQ 


ID 


NO:203, 




SEQ 


ED NO:204, 


SEQ 


EDNO:205, 


SEQ 


ED 


NO:206, 


SEQ 


IDNO:207, 


SEQ 


ID 


NO:208, 




SEQ 


ED NO:209, 


SEQ 


EDNO:210, 


SEQ 


ED 


NO:211, 


SEQ 


EDNO:212, 


SEQ 


ID 


NO:213, 




SEQ 


EDNO:214, 


SEQ 


EDNO:215, 


SEQ 


ED 


NO:216, 


SEQ 


IDNO:217, 


SEQ 


ID 


NO:218, 




SEQ 


EDNO:219, 


SEQ 


ED NO:220, 


SEQ 


ED 


NO:221, 


SEQ 


ED NO:222, 


SEQ 


ED 


NO:223, 


25 


SEQ 


ID NO:224, 


SEQ 


IDNO:225, 


SEQ 


ID 


NO:226, 


SEQ 


ED NO:227, 


SEQ 


ED 


NO:228, 




SEQ 


ED NO:229, 


SEQ 


ED NO:230, 


SEQ 


ED 


NO:231, 


SEQ 


EDNO:232, 


SEQ 


ED 


NO:233, 




SEQ 


ED NO:234, 


SEQ 


EDNO:235, 


SEQ 


ID 


NO:236, 


SEQ 


IDNO:237, 


SEQ 


ID 


NO:238, 




SEQ 


EDNO:239, 


SEQ 


ED NO:240, 


SEQ 


ID 


NO:241, 


and SEQ ED NO:242, where the 



domain is selected from the group consisting of an N-terminal domain, a catalytic domain, 



30 a C-terminal domain, a coiled-coil structure region, a proline-rich region, a spacer region, 

an insert, and a C-tenninal tail; (g) is the complement of the nucleotide sequence of (f); (h) 
encodes a polypeptide having an amino acid sequence selected from the group consisting 
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of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO:134, SEQ ID 
NO: 135, SEQ ID NO:136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID 
NO: 140, SEQ ID NO:141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID 
NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID 
NO:l 55, SEQ ID NO:l 56, SEQ ID NO:l 57, SEQ ID NO:l 58, SEQ ID NO:l 59, SEQ ID 
NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID 
NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 
NO:170, SEQ IDNO:171, SEQ IDNO:172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID 
NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 
NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID 
NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID 
NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding full-length amino 
acid sequence, or fragments thereof. A sequence that is substantially similar to a sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 
SEQ ID NO: 1 34, SEQ ID NO: 1 35 , SEQ ID NO: 1 36, SEQ ID NO: 1 3 7, SEQ ID NO: 1 3 8, 
SEQ ID NO:139, SEQ IDNO:140, SEQ IDNO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, 



WO 00/73469 



22 



PCT/US00/14842 



SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, 
SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, 
SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO:167, SEQ ID NO:168, 
5 SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, 
SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, 
SEQ ED NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO: 188, 
SEQ ED NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ID NO:193, 

10 SEQ ED NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO:l 97, SEQ ID NO: 1 98, 

SEQ ED NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ED NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:21 3, 
SEQ ED NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, 

1 5 SEQ ED NO:21 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID NO:223, 

SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, 
SEQ ED NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242 will have at 

20 least 75% identity (preferably 90%, more preferably at least 95% and most preferably 99- 
100%) to the sequence selected from the group consisting of those set forth in SEQ ED 
NO: 122, SEQ ED NO: 123, SEQ ED NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ED NO: 129, SEQ ED NO: 130, SEQ ID NO:131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 

25 NO:137, SEQ IDNO:138, SEQ ID N0.139, SEQ ID NO:140, SEQ ED NO:141, SEQ ID 
NO: 1 42, SEQ ED NO: 1 43, SEQ ED NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ED NO: 151, SEQ ID 
NO:152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO:156, SEQ ID 
NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 

30 NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ED NO: 166, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO:l 76, SEQ ID 
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NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under 
highly stringent conditions to the nucleotide molecule of (a) and encodes a naturally 
occurring kinase polypeptide; (d) encodes a kinase polypeptide having an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
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NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO.209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
5 N0:2 1 8, SEQ ID N0:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the 
1 0 corresponding full-length amino acid sequence, or fragments thereof. A sequence that is 

substantially similar to a sequence selected from the group consisting of those set forth in 
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SEQ ID NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242 will have at least 75% identity (preferably 90%, more preferably at 

5 least 95% and most preferably 99-100%) to the sequence of SEQ ID NO:122, SEQ ID 

NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO: 128, SEQ ID NO:129, SEQ ID NO: 130, SEQ ID N0.131, SEQ ID NO: 132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO:139,_SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ED 

10 NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ID 

NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO:153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ED NO: 160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 

15 NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID 

NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 178, SEQ ID NO: 179, SEQ ED NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 190, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 

20 NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED 
NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID 
NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 

25 NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID 
NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, 

30 except that it lacks one or more of the domains selected from the group consisting of a N- 
terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil structure region, a 
proline-rich region, a spacer region, an insert, and a C-terminal tail; or (i) is the 
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complement of the nucleotide sequence of (h). The domain demarcations of the 
polypeptides of the invention are indicated in Table 2 by reference to the kinase domain. 

The term "complement" refers to two nucleotides that can form multiple favorable 
interactions with one another. For example, adenine is complementary to thymine as they 

5 can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since 

they can form three hydrogen bonds. A nucleotide sequence is the complement of another 
nucleotide sequence if all of the nucleotides of the first sequence are complementary to all 
of the nucleotides of the second sequence. 

The term "domain" refers to a region of a polypeptide that contains a particular 

10 function. For instance, N-terminal or C-terminal domains of signal transduction proteins 

can serve functions including, but not limited to, binding molecules that localize the signal 
transduction molecule to different regions of the cell or binding other signaling molecules 
directly responsible for propagating a particular cellular signal. Some domains can be 
expressed separately from the rest of the protein and function by themselves, while others 

15 must remain part of the intact protein to retain function. The latter are termed functional 

regions of proteins and also relate to domains. 

The term "N-terminal domain" refers to the extracatalytic region located between 
the initiator methionine and the catalytic domain of the protein kinase. The N-terminal 
domain can be identified following a Smith-Waterman alignment of the protein sequence 

20 against the non-redundant protein database to define the N-terminal boundary of the 

catalytic domain. Depending on its length, the N-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose N-terminal 
domain has been shown to play a regulatory role is PAK65, which contains a CRIB motif 
used for Cdc42 and rac binding (Burbelo, P.D. et al (1995) J. Biol. Chem. 270, 29071- 

25 29074). The N-terminal domain of a protein kinase of the invention is that portion of the 

protein kinase to the amino-terminal side of the kinase domain where the kinase domain is 
identified in Table 2, herein. Further, in some cases, portions of the N-terminal domains 
of the protein kinases of the invention have not been identified since the entire sequence is 
not available. However, with the methods described herein, the full-length sequences of 

30 the kinases of the invention can be determined and using the approaches described herein 

the N-terminal domain can be identified. 
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The term "catalytic domain" or "kinase domain ' refers to a region of the protein 
kinase that is typically 25-300 amino acids long and is responsible for carrying out the 
phosphate transfer reaction from a high-energy phosphate donor molecule such as ATP or 
GTP to itself (autophosphorylation) or to other proteins (exogenous phosphorylation). 
The catalytic domain of protein kinases is made up of 12 subdomains that contain highly 
conserved amino acid residues, and are responsible for proper polypeptide folding and for 
catalysis. The catalytic domain can be identified following a Smith-Waterman alignment 
of the protein sequence against the non-redundant protein database. The catalytic/kinase 
domains of the protein kinases of the invention are identified in Table 2, herein. Further, 
in some cases, the complete sequence of the catalytic/kinase domains of the protein 
kinases of the invention may not have been provided since the entire sequence is not 
available- However, with the methods described herein, the full-length sequences of the 
kinases of the invention can be determined, and using the approaches described herein, the 
catalytic/kinase domain can be identified. 

The term "catalytic activity", as used herein, defines the rate at which a kinase 
catalytic domain phosphorylates a substrate. Catalytic activity can be measured, for 
example, by determining the amount of a substrate converted to a phosphorylated product 
as a function of time. Catalytic activity can be measured by methods of the invention by 
holding time constant and determining the concentration of a phosphorylated substrate 
after a fixed period of time. Phosphorylation of a substrate occurs at the active-site of a 
protein kinase. The active-site is normally a cavity in which the substrate binds to the 
protein kinase and is phosphorylated. 

The term "substrate" as used herein refers to a molecule phosphorylated by a 
kinase of the invention. Kinases phosphorylate substrates on serine/threonine or tyrosine 
amino acids. The molecule may be another protein or a polypeptide. 

The term "C-terminal domain" refers to the region located between the catalytic 
domain and the carboxy-terminal amino acid residue of the protein kinase. The C- 
terminal domain can be identified by using a Smith-Waterman alignment of the protein 
sequence against the non-redundant protein database to define the C-terminal boundary of 
the catalytic domain or of any flinctional C-terminal extracatalytic domain. Depending on 
its length and amino acid composition, the C-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose C-terminal 
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domain may play a regulatory role is PAK3 which contains a heterotrimeric G b subunit- 
binding site near its C-terminus (Leeuw, T. et ah (1998) Nature, 391, 191-195). The C- 
terminal domain of a protein kinase of the invention is that portion of the protein kinase to 
the carboxy-terminal side of the kinase domain where the kinase domain is identified in 
5 Table 2, herein. In some cases, the C-terminal domains of the protein kinases of the 

invention have not been provided since the entire sequence is not available. However, 
with the methods described herein, the full-length sequences of the kinases of the 
invention can be determined, and using the approaches described herein, the C-terminal 
domain can be identified. 

10 The term "signal transduction pathway" refers to the molecules that propagate an 

extracellular signal through the cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The polypeptide molecules involved in 
signal transduction processes are typically receptor and non-receptor protein tyrosine 
kinases, receptor and non-receptor protein phosphatases, SRC homology 2 and 3 domains, 

] 5 phosphotyrosine binding proteins (SRC homology 2 (SH2) and phosphotyrosine binding 

(PTB and PH) domain containing proteins), proline-rich binding proteins (SH3 domain 
containing proteins), nucleotide exchange factors, and transcription factors. 

The term "coiled-coil structure region" as used herein, refers to a polypeptide 
sequence that has a high probability of adopting a coiled-coil structure as predicted by 

20 computer algorithms such as COILS (Lupas, A. (1 996) Meth. Enzymology 266:5 1 3-525). 

Coiled-coils are fonned by two or three amphipathic a-helices in parallel. Coiled-coils can 
bind to coiled-coil domains of other polypeptides resulting in homo- or heterodimers 
(Lupas, A. (1991) Science 252:1 162-1 164). Coiled-coil-dependent oligomerization has 
been shown to be necessary for protein function including catalytic activity of 

25 serine/threonine kinases (Roe, J. et al (1997) J. Biol. Chem. 272:5838-5845). Coiled-coil 

regions in the proteins of the invention can be identified using these methods. They may 
be present as sub-domains of the N-terminal, kinase, or C-terminal domains of the 
polypeptides of the invention. 

The term "proline-rich region" as used herein, refers to a region of a protein kinase 

30 whose proline content over a given amino acid length is higher than the average content of 

this amino acid found in proteins (i.e., >10%). Proline-rich regions are easily discernable 
by visual inspection of amino acid sequences and quantitated by standard computer 



WO 00/73469 



29 



PCT/US00/14842 



sequence analysis programs such as the DNAStar program EditSeq. Proline-rich regions 
have been demonstrated to participate in regulatory protein -protein interactions. Among 
these interactions, those that are most relevant to this invention involve the "PxxP" proline 
rich motif found in certain protein kinases {i.e., human PAK1) and the SH3 domain of the 

5 adaptor molecule Nek (Galisteo, M.L. et al (1996) J. Biol. Chem. 271:20997-21000). 
Other regulatory interactions involving "PxxP" proline-rich motifs include the WW 
domain (Sudol, M. (1996) Prog. Biophys. Mol. Bio. 65:1 13-132). Proline rich regions in 
the proteins of the invention can be identified using these methods. They may be present 
-as subrdomains of the N-terminal, kinase, or C-terminal domains of the polypeptides of 

10 the invention. 

The term "spacer region" as used herein, refers to a region of the protein kinase 
located between predicted functional domains. The spacer region has no detectable 
homology to any amino acid sequence in the database, and can be identified by using a 
Smith-Waterman alignment of the protein sequence against the non-redundant protein 

15 database to define the C- and N-terminal boundaries of the flanking functional domains. 

Spacer regions may or may not play a fundamental role in protein kinase function. 
Precedence for the regulatory role of spacer regions in kinase function is provided by the 
role of the sre kinase spacer in inter-domain interactions (Xu, W. et al. (1997) Nature 
385:595-602). Spacer regions in the proteins of the invention can be identified using these 

20 methods. They may be present as sub-domains of the N-terminal, kinase, or C-terminal 

domains of the polypeptides of the invention. 

The term "insert" as used herein refers to a portion of a protein kinase that is 
absent from a close homolog. Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Smith-Waterman sequence alignment of the 

25 protein sequence against the non-redundant protein database, or by means of a multiple 
sequence alignment of homologous sequences using the DNAStar program Megalign. 
Inserts may play a functional role by presenting a new interface for protein-protein 
interactions, or by interfering with such interactions. Insert regions in the proteins of the 
invention can be identified using these methods. They may be present as sub-domains of 

30 the N-terminal, kinase, or C-terminal domains of the polypeptides of the invention. 
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The term "C-terminal tail" as used herein, refers to a C-terminal domain of a 
protein kinase, that by homology extends or protrudes past the C-terminal amino acid of 
its closest homolog. C-teiminal tails can be identified by using a Smith-Waterman 
sequence alignment of the protein sequence against the non-redundant protein database, or 
5 by means of a multiple sequence alignment of homologous sequences using the DNAStar 

program Megalign. Depending on its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. C-terminal tail regions in the proteins of the invention 
can be identified using these methods. They may be present as sub-domains of the N- 
terminal, kinase, or C-terminal domains of the polypeptides of the invention. 

] o Various low or high stringency hybridization conditions may be used depending 

upon the specificity and selectivity desired. These conditions are well-known to those 
skilled in the art. Under stringent hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of 
nucleic acids having more than 1 or 2 mismatches out of 20 contiguous nucleotides, more 

15 preferably, such conditions prevent hybridization of nucleic acids having more than 1 or 2 

mismatches out of 50 contiguous nucleotides, most preferably, such conditions prevent 
hybridization of nucleic acids having more than 1 or 2 mismatches out of 1 00 contiguous 
nucleotides. In some instances, the conditions may prevent hybridization of nucleic acids 
having more than 5 mismatches in the full-length sequence. 

20 By stringent hybridization assay conditions is meant hybridization assay conditions 

at least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaH 2 P0 4 , pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 5X Denhart 
solution at 42 °C overnight; washing with 2X SSC, 0.1% SDS at 45 °C; and washing with 
0.2X SSC, 0.1% SDS at 45 °C. Under some of the most stringent hybridization assay 

25 conditions, the second wash can be done with 0.1X SSC at a temperature up to 70 °C (pg. 

421, Berger etal (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. vol. 152, 
hereby incorporated by reference herein including any figures, tables, or drawings.). 
However, other applications may require the use of conditions falling between these sets 
of conditions. Methods of determining the conditions required to achieve desired 

30 hybridizations are well-known to those with ordinary skill in the art, and are based on 
several factors, including but not limited to, the sequences to be hybridized and the 
samples to be tested. 
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In other preferred embodiments, the invention features isolated, enriched, or 
purified nucleic acid molecules encoding kinase polypeptides, further comprising a vector 
or promoter effective to initiate transcription in a host cell. The invention also features 
recombinant nucleic acid, preferably in a cell or an organism. The recombinant nucleic 
acid may contain a sequence selected from the group consisting of those set forth in SEQ 
ID NO.T, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ 
ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31 , SEQ ID NO:32, SEQ ID NO:33, 
SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ 
ID NO:39, SEQ ID NO:40, SEQ ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, 
SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ 
ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, 
SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ 
ID NO:71 , SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81 , 
SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ED NO:90, SEQ ID NO:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, 
SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, 
SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, 
SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, 
SEQ ID NO: 11 3, SEQ ID NO: 1 1 4, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, 
SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID NO: 1 20, and SEQ ID NO: 1 21 , or a functional 
derivative thereof and a vector or a promoter effective to initiate transcription in a host 
cell. The recombinant nucleic acid can alternatively contain a transcriptional initiation 
region functional in a cell, a sequence complementary to an RNA sequence encoding a 
kinase polypeptide and a transcriptional termination region functional in a cell. Specific 
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vectors and host cell combinations are discussed herein. The recombinant nucleic acid can 
also contain the full-length sequence encoding the protein kinase, or a domain, for 
example. 

The term "vector" relates to a single or double-stranded circular nucleic acid 
5 molecule that can be transfected into cells and replicated within or independently of a cell 

genome. A circular double-stranded nucleic acid molecule can be cut and thereby 
linearized upon treatment with restriction enzymes. An assortment of nucleic acid vectors, 
restriction enzymes, and the knowledge of the nucleotide sequences cut by restriction 
enzymes are readily available to those skilled in the art. A nucleic acid molecule encoding 

1 0 a kinase can be inserted into a vector by cutting the vector with restriction enzymes and 

ligating the two pieces together. 

The term "transfecting" defines a number of methods to insert a nucleic acid vector 
or other nucleic acid molecules into a cellular organism. These methods involve a variety 
of techniques, such as treating the cells with high concentrations of salt, an electric field, 

] 5 detergent, or DMSO to render the outer membrane or wall of the cells permeable to 

nucleic acid molecules of interest or use of various viral transduction strategies. 

The term "promoter" as used herein, refers to nucleic acid sequence needed for 
gene sequence expression. Promoter regions vary from organism to organism, but are well 
known to persons skilled in the art for different organisms. For example, in prokaryotes, 

20 the promoter region contains both the promoter (which directs the initiation of RNA 

transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis initiation. Such regions will normally include those 5'-non-coding 
sequences involved with initiation of transcription and translation, such as the TATA box, 
capping sequence, CAAT sequence, and the like. 

25 In preferred embodiments, the isolated nucleic acid comprises, consists essentially 

of, or consists of a nucleic acid sequence set forth in SEQ ED NO:l, SEQ ID NO:2, SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 

30 SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID NO:22, SEQ ID NO:23, SEQ ED NO:24, SEQ 
ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 
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SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ 
ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 
NO:46, SEQ ED NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:51 , 
SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ 

5 IDNO:57, SEQ DD NO:58, SEQ ED NO:59, SEQ ED NO:60, SEQ ED N0:61, SEQ ID 

NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, 
SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71 , SEQ ID NO:72, SEQ 
ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81 , SEQ ID NO:82, SEQ ED NO:83, 

10 SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ 

ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, 
SEQ ID NO: 1 00, SEQ ID NO: 1 01 , SEQ ID NO: 1 02, SEQ ID NO: 1 03, SEQ ID NO: 1 04, 
SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, 

15 SEQ ID NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, 

SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, 
SEQ ID NO:120, and SEQ ID NO:121, or the corresponding full-length sequence, encodes 
an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 

20 NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ED NO: 130, SEQ ED NO: 131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ED NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ED NO: 141, SEQ ID 
NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO:147, SEQ IDNO:148, SEQ ID NO:149, SEQ ED NO:150, SEQ ID NO:151, SEQ ID 

25 NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ED 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ DD NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID 

30 NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO:l 79, SEQ ED NO:l 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ ID NO:186, SEQ ID 
NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ DD NO: 1 9 1 , SEQ ED 
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NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:20 1 , SEQ ID 
NO:202, SEQ ID NO.203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
5 NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID N0.231, SEQ ID 
N0:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
1 0 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, a functional derivative 
thereof, or at least 10, 20, 40, 50, 75, 100, 200, 300 or 500 contiguous amino acids of a 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
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NO:21 8, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223 5 SEQ ID NO:224, SEQ ID NO:225 5 SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228 s SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:231 , SEQ ID NO:232 5 SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

5 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the 
corresponding full-length sequences or derivatives thereof. The nucleic acid may be 
isolated from a natural source by cDNA cloning or by subtractive hybridization. The 
natural source may be mammalian, preferably human, blood, semen, or tissue, and the 
nucleic acid may be synthesized by the triester method or by using an automated DNA 

10 synthesizer. 

The term "mammal" refers preferably to such organisms as mice, rats, rabbits, 
guinea pigs, sheep, and goats, more preferably to cats, dogs, monkeys, and apes, and most 
preferably to humans. 

In yet other preferred embodiments, the nucleic acid is a conserved or unique 

1 5 region, for example those useful for: the design of hybridization probes to facilitate 

identification and cloning of additional polypeptides, the design of PCR probes to 
facilitate cloning of additional polypeptides, obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions", are meant regions present on two or more 

20 nucleic acids encoding a kinase polypeptide, to which a particular nucleic acid sequence 

can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acid encoding kinase polypeptides are 
provided in Berger et al (1 987) Guide to Molecular Cloning Techniques , Meth. Enzym. 
vol. 152, hereby incorporated by reference herein in its entirety, including any drawings, 

25 figures, or tables. Preferably, conserved regions differ by no more than 5 out of 20 

nucleotides, even more preferably 2 out of 20 nucleotides or most preferably 1 out of 20 
nucleotides. 

By "unique nucleic acid region" is meant a sequence present in a nucleic acid 
coding for a kinase polypeptide that is not present in a sequence coding for any other 
30 naturally occurring polypeptide. Such regions preferably encode 10 (preferably 25, more 
preferably 50, most preferably 75) or more contiguous amino acids selected from the 
group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
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SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO: 130, SEQ ID NO:131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, 
SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 
5 SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 150, SEQ ID NO:l 51, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, 
SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO:l 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, 
SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ED NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 

10 SEQ ID NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, 

SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ED NO: 1 80, SEQ ED NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ED NO: 1 90, SEQ ED NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ ED NO: 1 94, 

15 SEQ ED NO: 195, SEQ ED NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO: 199, 
SEQ ED NO:200, SEQ ID NO:201 , SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, 
SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, 
SEQ ED NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 

20 SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, 
SEQ ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, 
SEQ ED NO:230, SEQ ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ED NO:234, 
SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ED NO:239, 
SEQ ED NO:240, SEQ ED NO:241, and SEQ ED NO:242, or functional derivatives thereof. 

25 In particular, a unique nucleic acid region is preferably of mammalian origin and 
preferably human. 

A second aspect of the invention features a nucleic acid probe for the detection of 
nucleic acid encoding a kinase polypeptide in a sample, wherein said polypeptide is 
selected from the group consisting of SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, 
30 SEQ ED NO: 125, SEQ ED NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, 
SEQ ED NO:130, SEQ ED NO:131, SEQ ED NO:132, SEQ ED NO:133, SEQ ED NO:134, 
SEQ ID NO:135, SEQ EDNO:136, SEQ ED NO:137, SEQ ED NO:138, SEQ ID NO: 139, 
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SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 170, SEQ ID NO:171, SEQ ID N0.172, SEQ ID NO: 173, SEQ ID NO: 174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID N0.178, SEQ ID NO:179, 
SEQ ID NO: 180, SEQ ID NO: 1 81 , SEQ ID NO:l 82, SEQ ID NO:l 83, SEQ ID NO:l 84, 
SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO: 188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ED NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. Preferably, the nucleic acid 
probe encodes a kinase polypeptide that is a fragment of the protein encoded by an amino 
acid sequence selected from the group consisting of those set forth in SEQ ED NO:122, 
SEQ ID NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ED NO:127, 
SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ED NO:137, 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, 
SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, 
SEQ ID NO: 153, SEQ ID NO: 154, SEQ ED NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, 
SEQ ID NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, 
SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, 
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SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
SEQ ID NO: 183, SEQIDNO:184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ID NO:187, 

5 SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, 

SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ED NO: 1 97, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO.202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ED_NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 

10 SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 

SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, 
SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, 

15 SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ED 

NO:242, or the corresponding full-length amino acid sequences. The nucleic acid probe 
contains a nucleotide base sequence that will hybridize to a sequence selected from the 
group consisting of those set forth in SEQ ED NO:l, SEQ ED NO:2, SEQ ED NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ED NO:8, SEQ ID NO:9, 

20 SEQ ED NO: 1 0, SEQ ED NO: 1 1 , SEQ ED NO: 1 2, SEQ ED NO: 1 3, SEQ ID NO: 1 4, SEQ 
ID NO: 1 5, SEQ ED NO: 1 6, SEQ ED NO: 1 7, SEQ ID NO: 1 8, SEQ ED NO: 1 9, SEQ ID 
NO:20, SEQ ED NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID N0.25, 
SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ED NO:31, SEQ ED NO:32, SEQ ID NO:33, SEQ ED NO:34, SEQ ID NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:4 1 , 
SEQ ED NO:42, SEQ ED NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ 
ED NO:47, SEQ ED NO:48, SEQ ED NO:49, SEQ ED NO:50, SEQ ED NO:51, SEQ ID 
NO:52, SEQ ED NO:53, SEQ ED NO:54, SEQ ED NO:55, SEQ ED NO:56, SEQ ED NO:57, 
SEQ ED NO:58, SEQ ED NO:59, SEQ ED NO:60, SEQ ED NO:61, SEQ ED NO:62, SEQ 

30 ED NO:63, SEQ ED NO:64, SEQ ED NO:65, SEQ ED NO:66, SEQ ED NO:67, SEQ ED 

NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ED NO:71, SEQ ED NO:72, SEQ ED NO:73, 
SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ID NO:77, SEQ ED NO:78, SEQ 
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ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, 
SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ 
ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID 
NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID 
NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ED NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID 
NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID 
NO:120, and SEQ ID NO:121, or the corresponding full-length sequence, or a functional 
derivative thereof. 

In preferred embodiments, the nucleic acid probe hybridizes to nucleic acid 
encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino 
acids of a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO: 136, SEQ ID 
NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO:l 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO:172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO:l 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 199, SEQ ED NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO:197, SEQ ED NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
NO:212, SEQEDNO:213, SEQ ID NO:214, SEQEDNO:215, SEQIDNO:216, SEQ ED 
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NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232 3 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
5 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or functional 
derivatives thereof 

Methods for using the probes include detecting the presence or amount of kinase 
RNA in a sample by contacting the sample with a nucleic acid probe under conditions 

10 such that hybridization occurs and detecting the presence or amount of the probe bound to 

kinase RNA. The nucleic acid duplex formed between the probe and a nucleic acid 
sequence coding for a kinase polypeptide may be used in the identification of the sequence 
of the nucleic acid detected (Nelson et ai 9 in Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed., p. 275, 1992, hereby incorporated by reference 

15 herein in its entirety, including any drawings, figures, or tables). Kits for performing such 

methods may be constructed to include a container means having disposed therein a 
nucleic acid probe. 

In a third aspect, the invention describes a recombinant cell or tissue comprising a 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting of 



20 



SEQ 


ID 


NO:122, 


SEQ 


ED 


NO: 123, 


SEQ 


ID 


NO: 124, 


SEQ 


ID 


NO:125, 


SEQ ID 


NO: 126, 


SEQ 


ID 


NO:127, 


SEQ 


ID 


NO:128, 


SEQ 


ID 


NO:129, 


SEQ 


ID 


NO: 130, 


SEQ ID 


NO:131, 


SEQ 


ID 


NO:132, 


SEQ 


ID 


NO:133, 


SEQ 


ID 


NO: 134, 


SEQ 


ID 


NO:135, 


SEQ ID 


NO:136, 


SEQ 


ID 


NO:137, 


SEQ 


ID 


NO:138, 


SEQ 


ID 


NO:139, 


SEQ 


ID 


NO: 140, 


SEQ ID 


NO:141, 


SEQ 


ID 


NO:142, 


SEQ 


ID 


NO:143, 


SEQ 


ID 


NO: 144, 


SEQ 


ED 


NO:145, 


SEQ ID 


NO:146, 


SEQ 


ID 


NO:147, 


SEQ 


ID 


NO:148, 


SEQ 


ID 


NO: 149, 


SEQ 


ED 


NO: 150, 


SEQ ID 


NO:151, 


SEQ 


ID 


NO:152, 


SEQ 


ID 


NO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO:l 55, 


SEQ ED 


NO: 156, 


SEQ 


ID 


NO:157, 


SEQ 


ID 


NO: 158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ ED 


NO:161, 


SEQ 


ID 


NO: 162, 


SEQ 


ID 


NO:163, 


SEQ 


ED 


NO: 164, 


SEQ 


ED 


NO: 165. 


SEQ ED 


NO: 166, 


SEQ 


ID 


NO: 167, 


SEQ 


ID 


NO:168, 


SEQ 


ID 


NO: 169, 


SEQ 


ED 


NO: 170, 


SEQ ID 


NO:171, 


SEQ 


ID 


NO: 172, 


SEQ 


ID 


NO: 173, 


SEQ 


ED 


NO: 174, 


SEQ 


ED 


NO:175, 


SEQ ID 


NO: 176, 


SEQ 


ID 


NO:177, 


SEQ 


ID 


NO:178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ ID 


NO:181, 


SEQ 


ID 


NO: 182, 


SEQ 


ID 


NO:183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ ID 


NO:186, 



WO 00/73469 



41 



PCT/US00/14842 



SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , 
SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO:195, SEQ ID NO: 196, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ED NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242. In such cells, the nucleic acid may be under the control of the 
genomic regulatory elements, or may be under the control of exogenous regulatory 
elements including an exogenous promoter. By "exogenous" it is meant a promoter that is 
not normally coupled in vivo transcriptionally to the coding sequence for the kinase 
polypeptides. 

The polypeptide is preferably a fragment of the protein encoded by an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ IDNO:171, SEQ ID NO: 172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ED NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID 
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NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
5 NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

1 0 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the 
corresponding full-length amino acid sequence. By "fragment," is meant an amino acid 
sequence present in a kinase polypeptide. Preferably, such a sequence comprises at least 
10, 20, 40, 50, 75, 100, 200, or 300 contiguous amino acids a sequence selected from the 
group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 

15 SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 130, SEQ ID NO:131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, 
SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ED NO:143, SEQ ID NO:144, 
SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 

20 SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, 
SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO:l 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 
SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, 

25 SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, 
SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, 
SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 

30 SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
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SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID N0:21 8, SEQ ID NO:2l9, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, or of the corresponding full- 
length amino acid sequence, or a functional derivative thereof. 

In a fourth aspect, the invention features an isolated, enriched, or purified kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ 
ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33 , SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ED NO: 136, SEQ ED NO: 137, SEQ ID NO: 13 8, SEQ 
ID NO: 1 39, SEQ ID NO: 1 40, SEQ ED NO: 1 4 1 , SEQ ED NO: 1 42, SEQ ED NO: 1 43 , SEQ 
ID NO:l 44, SEQ ED NO: 145, SEQ ED NO: 1 46, SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ 
ID NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO:l 52, SEQ ED NO: 1 53, SEQ 
ID NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ 
ID NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ 
ID NO: 1 64, SEQ ED NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 
ID NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ 
ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83 , SEQ 
ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ID NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ 
ID NO:194, SEQ ED NO:195, SEQ ID NO:196, SEQ ED NO:197, SEQ ID NO:198, SEQ 
ID NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ED NO.210, SEQ ED NO:21 1 , SEQ ED NO:212, SEQ ED NO:21 3, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ED NO:220, SEQ ID NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ 
ID NO:229, SEQ ED NO:230, SEQ ID NO:231 , SEQ ED NO:232, SEQ ID NO:233, SEQ 
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ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 

By "isolated" in reference to a polypeptide is meant a polymer of amino acids (2 or 
more amino acids) conjugated to each other, including polypeptides that are isolated from 
5 a natural source or that are synthesized. The isolated polypeptides of the present invention 

are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

1 0 the only amino acid chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-amino acid material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2 - 5 fold) of the 
total amino acid sequences present in the cells or solution of interest than in normal or 

15 diseased cells or in the cells from which the sequence was taken. This could be caused by 

a person by preferential reduction in the amount of other amino acid sequences present, or 
by a preferential increase in the amount of the specific amino acid sequence of interest, or 
by a combination of the two. However, it should be noted that enriched does not imply 
that there are no other amino acid sequences present, just that the relative amount of the 

20 sequence of interest has been significantly increased. The term significant here is used to 

indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other amino acid sequences of about at least 2-fold, 
more preferably at least 5- to 10-fold or even more. The term also does not imply that 
there is no amino acid sequence from other sources. The other source of amino acid 

25 sequences may, for example, comprise amino acid sequence encoded by a yeast or 

bacterial genome, or a cloning vector such as pUC19. The term is meant to cover only 
those situations in which man has intervened to increase the proportion of the desired 
amino acid sequence. 

It is also advantageous for some purposes that an amino acid sequence be in 

30 purified form. The term "purified" in reference to a polypeptide does not require absolute 
purity (such as a homogeneous preparation); instead, it represents an indication that the 
sequence is relatively purer than in the natural environment. Compared to the natural level 
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this level should be at least 2-5 fold greater (e.g., in terms of mg/mL). Purification of at 
least one order of magnitude, preferably two or three orders, and more preferably four or 
five orders of magnitude is expressly contemplated. The substance is preferably free of 
contamination at a functionally significant level, for example 90%, 95%, or 99% pure. 

5 In preferred embodiments, the kinase polypeptide is a fragment of the protein 

encoded by an amino acid sequence selected from the group consisting of those set forth in 
SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, 
SEQ ID NO:127, SEQ ID NO: 128, SEQ ID NO:129, SEQ ID NO: 130, SEQ ID NO:131, 
SEQ ID NO:132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 

0 SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO:l 56, 
SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , 

5 SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 
SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO : 1 70, SEQ ID NO: 1 7 1 , 
SEQ ID NO:l 72, SEQ ID NO: 1 73, SEQ ID NO:l 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO:182, SEQ ID NO: 183, SEQ ID NO:184, SEQ ID NO: 185, SEQ ID NO: 186, 

!0 SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO:191, 

SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , 

>5 SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ED NO:216, 
SEQ ID NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, 
SEQ ID NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, 
SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, 

J0 SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 

and SEQ ID NO:242, or the corresponding full-length amino acid sequences. Preferably, 
the kinase polypeptide contains at least 10, 20, 40, 50, 75, 100, 200, or 300 contiguous 
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amino acids a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ED NO:131, SEQ ID 
NO:132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO:136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO:141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO: 157, SEQ ID NO: 158, SEQ ID NO:159, SEQ ID NO: 160, SEQ ID NO:161, SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ED 
NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ TD NO: 195, SEQ ED NO: 196, SEQ ED 
NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ 3D NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 
NO:2 1 7, SEQ ID NO:2 1 8, SEQ ED NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or a functional 
derivative thereof. 

In preferred embodiments, the kinase polypeptide comprises an amino acid 
sequence having (a) an amino acid sequence selected from the group consisting of those 
set forth in SEQ ED NO:122, SEQ ID NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ 
ID NO:126, SEQ ED NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ 
ID NO:131, SEQ IDNO:132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, SEQ 
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ED NO:241, and SEQ ED NO:242; (b) an amino acid sequence selected from the group 
consisting of those set forth in SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ 
ED NO: 125, SEQ ED NO: 126, SEQ ED NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ 

25 ID NO: 130, SEQ ED NO: 131, SEQ ED NO: 132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ 

ID NO:135, SEQ ID NO: 136, SEQ ID NO:137, SEQ ID NO: 138, SEQ ID NO:139, SEQ 
IDNO:140, SEQ EDNO:141, SEQ ID NO:142, SEQ ID NO: 143, SEQ ID NO:144, SEQ 
ED NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ID NO:148, SEQ ED NO:149, SEQ 
ID NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ 

30 ED NO: 155, SEQ ED NO: 156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ 
ED NO:l 60, SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ID NO: 1 64, SEQ 
EDNO:165. SEQ ED NO: 166, SEQ ED NO:167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ 
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ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ 
ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ 
ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ 
ID NO:185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 
ID NO: 190, SEQ IDNO:191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ 
ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ 
ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ ED NO:21 6, SEQ ID NO:217, SEQ ID NO:21 8, SEQ ID NO:219, SEQ 
ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but 
not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; (c) an amino acid sequence of a domain of 
a polypeptide selected from the group consisting of those set forth in SEQ ID NO: 122, 
SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO:125, SEQ ID NO: 126, SEQ ID NO: 127, 
SEQ ID NO: 128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO: 132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, 
SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, 
SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, 
SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, 
SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, 
SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO : 1 7 1 , SEQ ID NO: 1 72, 
SEQ ID NO:173, SEQ ID NO: 174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 187, 
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SEQ ID NO: 1 88, SEQ ID NO:l 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, 
SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID N0.197, 
SEQ ID NO: 198, SEQ ID NO:l 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 
N0.242 where the domain is selected from the group consisting of an N-terminal domain, 
a catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich 
region, a spacer region, an insert, and a C-terminal tail; or (d) an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID N0:123, 
SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 1 28, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ED NO: 131, SEQ ID NO: 132, SEQ ED NO: 133, 
SEQ ID N0:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO: 139, SEQ ED NO: 140, SEQ ID NO:141, SEQ ID NO: 142, SEQ ID NO: 143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ID NO: 149, SEQ ED NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID NO: 153, 
SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, 
SEQ ID NO:159, SEQ ED NO:160, SEQ ED NO:161, SEQ ID NO:162, SEQ ID NO:163, 
SEQ ID NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ID NO:169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, 
SEQ ID NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO : 1 83 , 
SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ED NO:188, 
SEQ ID N0:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, 
SEQ ID NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ID N0:199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ED NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, 
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SEQ ID NO.209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231 , SEQ ID NO:232, SEQ ID NO:233, 
SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ ED NO:242, except that it 
lacks one or more, but not all, of the domains selected from the group consisting of a C- 
terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a proline-nch 
region, a coiled-coil structure region, an insert, and a C-terminal tail. (The domain 
demarcations of the polypeptides of the invention are indicated in Table 2 by reference to 
the kinase domain.) 

The polypeptide can be isolated from a natural source by methods well-known in 
the art. The natural source may be mammalian, preferably human, blood, semen, or tissue, 
and the polypeptide may be synthesized using an automated polypeptide synthesizer. The 
isolated, enriched, or purified kinase polypeptide is preferably selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ED NO:125, SEQ ED NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ 
ID NO:130, SEQ ED NO:131, SEQ EDNO:132, SEQ ED NO:133, SEQ ID NO:134, SEQ 
ED NO:135, SEQ ED NO:136, SEQ ED NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ 
ED NO: 140, SEQ ED NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ 
ED NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ 
ED NO:150, SEQ ED NO:151, SEQ ED NO:152, SEQ ED NO:153, SEQ ID NO:154, SEQ 
ID NO: 155, SEQ ED NO:156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ 
ID NO: 160, SEQ ED NO: 161, SEQ ED NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ 
ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ 
ID NO:170, SEQ ED NO:171, SEQ ED NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ 
ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ 
ID NO: 180, SEQ ID NO:181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ 
ID NO: 185, SEQ ED NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ 
IDNO:190, SEQ ID NO:191, SEQ ED NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ 
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ID NO:200, SEQ ID NO:201, SEQ ID NO.202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ IDNO:216, SEQ IDNO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 

5 ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242A. 

10 In some embodiments the invention includes a recombinant kinase polypeptide 

selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
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SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. By "recombinant kinase 
polypeptide" is meant a polypeptide produced by recombinant DNA techniques such that 
5 it is distinct from a naturally occurring polypeptide either in its location (e.g., present in a 

different cell or tissue than found in nature), purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that 
normally observed in nature. 

Jn-a fifth aspect, the invention features an antibody (eg., a monoclonal or 

10 polyclonal antibody) having specific binding affinity to a kinase polypeptide or a kinase 

polypeptide domain or fragment where the polypeptide is selected from the group 
consisting of SEQ ID NO:122, SEQ ID NO:123 5 SEQ ID NO:124, SEQ ID NO:125, SEQ 
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ID NO:226, SEQ ID NO:227 5 SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242. In preferred embodiments, the antibody binds 

5 specifically to domains of kinase polypeptides, that are defined supra. 

By "specific binding affinity" is meant that the antibody binds to the target kinase 
polypeptide with greater affinity than it binds to other polypeptides under specified 
conditions. Antibodies or antibody fragments are polypeptides that contain regions that 
can bind other polypeptides. The term "specific binding affinity" describes an antibody 

10 that binds to a kinase polypeptide with greater affinity than it binds to other polypeptides 

under specified conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 
antigenic functional derivative thereof. For the production of polyclonal antibodies, 

15 various host animals may be immunized by injection with the antigen. Various adjuvants 

may be used to increase the immunological response, depending on the host species. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 
to a particular antigen. They may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 

20 antibodies may be obtained by methods known to those skilled in the art (Kohler ei al. 9 

Nature 256:495-497, 1975, and U.S. Patent No. 4,376,1 10, both of which are hereby 
incorporated by reference herein in their entirety including any figures, tables, or 
drawings). 

The term "antibody fragment" refers to a portion of an antibody, often the hyper 
25 variable region and portions of the surrounding heavy and light chains, that displays 

specific binding affinity for a particular molecule. A hyper variable region is a portion of 
an antibody that physically binds to the polypeptide target. 

Antibodies or antibody fragments having specific binding affinity to a kinase 
polypeptide or domains of a kinase polypeptide of the invention may be used in methods 
30 for detecting the presence and/or amount of kinase polypeptide in a sample by probing the 

sample with the antibody under conditions suitable for kinase-antibody immunocomplex 
formation and detecting the presence and/or amount of the antibody conjugated to the 
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kinase polypeptide. Diagnostic kits for performing such methods may be constructed to 
include antibodies or antibody fragments specific for the kinase as well as a conjugate of a 
binding partner of the antibodies or the antibodies themselves. 

An antibody or antibody fragment with specific binding affinity to a kinase 
5 polypeptide of the invention can be isolated, enriched, or purified from a prokaryotic or 

eukaryotic organism. Routine methods known to those skilled in the art enable production 
of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms. 
Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are 
described above. 

10 Antibodies having specific binding affinity to a kinase polypeptide of the invention 

maybe used in methods for detecting the presence and/or amount of kinase polypeptide in 
a sample by contacting the sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or amount of the antibody 
conjugated to the kinase polypeptide. Diagnostic kits for performing such methods may 

1 5 be constructed to include a first container containing the antibody and a second container 

having a conjugate of a binding partner of the antibody and a label, such as, for example, a 
radioisotope. The diagnostic kit may also include notification of an FDA approved use 
and instructions therefor. 

In a sixth aspect, the invention features a hybridoma which produces an antibody 

20 having specific binding affinity to a kinase polypeptide or a kinase polypeptide domain, 
where the polypeptide is selected from the group consisting of SEQ ID MO: 122, SEQ ID 
NO:123, SEQ IDNO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ ID NO: 132, SEQ ID 
NO: 133, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID 

25 NO: 13 8, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO:153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 1 58, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 

30 NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO:168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO:173, SEQ IDNO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
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NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID NO: 182, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO:l 88, SEQ ID NO:l 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
NO:193, SEQ ID NO: 194, SEQ ED NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ IDNO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ IDNO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242; and 
where the domains are defined as above. By "hybridoma" is meant an immortalized cell 
line that is capable of secreting an antibody, for example an antibody to a kinase of the 
invention. In preferred embodiments, the antibody to the kinase comprises a sequence of 
amino acids that is able to specifically bind a kinase polypeptide of the invention. 

In a seventh aspect, the invention features a kinase polypeptide binding agent able 
to bind to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, 
SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ED NO:127, 
SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO:141, SEQ ID NO: 142, 
SEQ ID NO: 143, SEQ ID NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, 
SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO:l 52, 
SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, 
SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, 
SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, 
SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, 
SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ED NO:181, SEQ ID NO: 182, 
SEQ ID NO:l 83, SEQ ID NO:l 84, SEQ ID NO: 185, SEQ ID NO:l 86, SEQ ID NO:l 87, 
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SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, 
SEQ ID NO: 193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO: 196, SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
5 SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 ] 5, SEQ ID NO:2 1 6, SEQ ID NO:2 1 7, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 

10 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 

SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242. The binding agent is preferably a purified antibody that recognizes an epitope 
present on a kinase polypeptide of the invention. Other binding agents include molecules 
that bind to kinase polypeptides and analogous molecules that bind to a kinase 

15 polypeptide. Such binding agents may be identified by using assays that measure kinase 

binding partner activity, such as those that measure PDGFR activity. 

The invention also features a method for screening for human cells containing a 
kinase polypeptide of the invention or an equivalent sequence. The method involves 
identifying the novel polypeptide in human cells using techniques that are routine and 

20 standard in the art, such as those described herein for identifying the kinases of the 

invention (e.g., cloning, Southern or Northern blot analysis, in situ hybridization, PCR 
amplification, etc.). 

In an eighth aspect, the invention features methods for identifying a substance that 
modulates kinase activity comprising the steps of: (a) contacting a kinase polypeptide 

25 selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, 
SEQ ID NO:130, SEQ ID NO:131, SEQ IDNO:132, SEQ ID NO:133, SEQ ID NO:134, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, 
SEQ ED NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

30 SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ED NO:155, SEQ ED NO:156, SEQ ED NO:157, SEQ ID NO:158, SEQ ID NO:159, 
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SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 
SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 

5 SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ID NO:190, SEQ ID NO:191, SEQ ID N0.199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO: 195, SEQ ID NO:196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, 
SEQ ID NO:200, SEQ ID NO.201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 

10 SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ED NO:229, 

15 SEQ ID NO:230, SEQ ID NO:23 1, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID N0.235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242 with a test substance; (b) 
measuring the activity of said polypeptide; and (c) determining whether said substance 
modulates the activity of said polypeptide. 

20 The term "modulates" refers to the ability of a compound to alter the function of a 

kinase of the invention. A modulator preferably activates or inhibits the activity of a 
kinase of the invention. 

The term "activates" refers to increasing the cellular activity of the kinase. The 
term inhibit refers to decreasing the cellular activity of the kinase. Kinase activity is 

25 preferably the interaction with a natural binding partner. 

The term "modulates" also refers to altering the function of kinases of the 
invention by increasing or decreasing the probability that a complex forms between the 
kinase and a natural binding partner. A modulator preferably increases the probability that 
such a complex forms between the kinase and the natural binding partner, more preferably 

30 increases or decreases the probability that a complex forms between the kinase and the 

natural binding partner depending on the concentration of the compound exposed to the 
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kinase, and most preferably decreases the probability that a complex forms between the 
kinase and the natural binding partner. 

The term "complex" refers to an assembly of at least two molecules bound to one 
another. Signal transduction complexes often contain at least two protein molecules 
5 bound to one another. For instance, a protein tyrosine receptor protein kinase, GRB2, 

SOS, RAF, and RAS assemble to form a signal transduction complex in response to a 
mitogenic ligand. 

The term "natural binding partner" refers to polypeptides, lipids, small molecules, 
or nucleic acids that bind to kinases in cells. A change in the interaction between a kinase 

10 and a natural binding partner can manifest itself as an increased or decreased probability 

that the interaction forms, or an increased or decreased concentration of kinase/natural 
binding partner complex. 

The term "contacting" as used herein refers to mixing a solution comprising the 
test compound with a liquid medium bathing the cells of the methods. The solution 

15 comprising the compound may also comprise another component, such as dimethyl 

sulfoxide (DMSO), which facilitates the uptake of the test compound or compounds into 
the cells of the methods. The solution comprising the test compound may be added to the 
medium bathing the cells by utilizing a delivery apparatus, such as a pipet-based device or 
syringe-based device. 

20 In a ninth aspect, the invention features methods for identifying a substance that 

modulates kinase activity in a cell comprising the steps of: (a) expressing a kinase 
polypeptide in a cell, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ED NO: 1 25, SEQ ID NO: 1 26, 
SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ED NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , 

25 SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO:136, 

SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141 , 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144 5 SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO:l 56, 

30 SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, 
SEQ ID NO:162, SEQ ID NO:163 5 SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO: 166, 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, 
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15 and SEQ ID NO:242; (b) adding a test substance to said cell; and (c) monitoring a change 

in cell phenotype or the interaction between said polypeptide and a natural binding 
partner. 

The term "expressing" as used herein refers to the production of kinases of the 
invention from a nucleic acid vector containing kinase genes within a cell. The nucleic 
20 acid vector is transfected into cells using well known techniques in the art as described 

herein. 

In a tenth aspect, the invention provides methods for treating a disease or abnormal 
condition by administering to a patient in need of such treatment a substance that 
modulates the activity of a polypeptide selected from the group consisting of SEQ ID 

25 NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 

30 NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
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NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ED NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 81 , SEQ ID 
5 NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

10 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:231 , SEQ ID 

1 5 NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ 
ED NO:242. Preferably, the disease is selected from the group consisting of immune- 
related diseases and disorders, cardiovascular disease, neurodegenerative disorders, and 
cancer. Also included are metabolic disorders, such as diabetes mellitus, and reproductive 

20 disorders, such as infertility. 

Preferably, the disease or disorder is selected from the group consisting of 
rheumatoid arthritis, artherosclerosis, autoimmune disorders, and organ transplantation. 
Preferably the disease or disorder is selected from the group consisting of immune-related 
diseases and disorders, myocardial infarction, cardiomyopathies, stroke, renal failure, and 

25 oxidative stress-related neurodegenerative disorders. Most preferably, the immune-related 
diseases and disorders are selected from the group consisting of rheumatoid arthritis, 
chronic inflammatory bowel disease, chronic inflammatory pelvic disease, multiple 
sclerosis, asthma, osteoarthritis, psoriasis, atherosclerosis, rhinitis, autoimmunity, and 
organ transplantation. 

30 Substances useful for treatment of disorders or diseases preferably show positive 

results in one or more in vitro assays for an activity corresponding to treatment of the 
disease or disorder in question Substances that modulate the activity of the polypeptides 
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preferably include, but are not limited to, antisense oligonucleotides and inhibitors of 
protein kinases. 

The term "preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 
5 The term "treating" refers to having a therapeutic effect and at least partially 

alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors causing 
or contributing to the abnormal condition. A therapeutic effect relieves to some extent one 
or more of the symptoms of the abnormal condition. In reference to the treatment of 
10 abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an 
increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (i.e., 
slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some 
extent one or more of the symptoms associated with the abnormal condition; and (e) 
enhancing the function of the affected population of cells. Compounds demonstrating 
15 efficacy against abnormal conditions can be identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
condition can relate to cell proliferation, cell differentiation or cell survival. An abnormal 
condition may also include irregularities in cell cycle progression, i.e., irregularities in 
20 normal cell cycle progression through mitosis and meiosis. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenesis, wound healing, psoriasis, 
diabetes mellitus, and inflammation. 

Abnormal differentiation conditions include, but are not limited to 
25 neurodegenerative disorders, slow wound healing rates, and slow tissue grafting healing 

rates. 

Abnormal cell survival conditions relate to conditions in which programmed cell 
death (apoptosis) pathways are activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathways. Aberrations in the function of any one of the 
30 protein kinases could lead to cell immortality or premature cell death. 



WO 00/73469 



62 



PCTAJSOO/14842 



The term "aberration", in conjunction with the function of a kinase in a signal 
transduction process, refers to a kinase that is over- or under-expressed in an organism, 
mutated such that its catalytic activity is lower or higher than wild-type protein kinase 
activity, mutated such that it can no longer interact with a natural binding partner, is no 
5 longer modified by another protein kinase or protein phosphatase, or no longer interacts 

with a natural binding partner. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated when 
the cells or tissues of the organism exist within the organism or outside of the organism. 

10 Cells existing outside the organism can be maintained or grown in cell culture dishes. For 

cells harbored within the organism, many techniques exist in the art to administer 
compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol 
applications. For cells outside of the organism, multiple techniques exist in the art to 
administer the compounds, including (but not limited to) cell microinjection techniques, 

1 5 transformation techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated by administering a 
compound to a group of cells having an aberration in a signal transduction pathway to an 
organism. The effect of administering a compound on organism function can then be 
monitored. The organism is preferably a mouse, rat, rabbit, guinea pig, or goat, more 

20 preferably a monkey or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods for detection the expression 
of a polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting the sample with a nucleic acid probe which 
hybridizes under hybridization assay conditions to a nucleic acid target region of a kinase 

25 polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ED NO:123, SEQ 

ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ 
ID NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 

30 ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 148, SEQ 
ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ 
ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ 
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ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ 
ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 171, SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 
ID NO: 1 74, SEQ ID NO: 1 75 , SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 

5 ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83 , SEQ 
ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ 
ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ 
ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 
ID NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 

10 ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 

ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

15 ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 

ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ED NO:239, SEQ ED NO:240, SEQ ED NO:241 , and SEQ ED NO:242, said probe 
comprising the nucleic acid sequence encoding the polypeptide, fragments thereof, and the 
complements of the sequences and fragments; and (b) detecting the presence or amount of 

20 the probe:target region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease or disorder is selected from 
the group consisting of rheumatoid arthritis, artherosclerosis, autoimmune disorders, organ 
transplantation, myocardial infarction, cardiomyopathies, stroke, renal failure, oxidative 
stress-related neurodegenerative disorders, metabolic disorder including diabetes, 

25 reproductive disorders including infertility, and cancer. 

The kinase "target region" is a nucleotide base sequence selected from the group 
consisting of those set forth in SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ED NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO: 10, SEQ ED NO: 1 1, SEQ ED NO: 12, SEQ ED NO: 13, SEQ ED NO: 14, SEQ ID 

30 NO: 1 5, SEQ ED NO: 1 6, SEQ ED NO: 1 7, SEQ ID NO: 1 8, SEQ ED NO: 1 9, SEQ ED NO:20, 
SEQ ED NO:21, SEQ ID NO:22, SEQ ED NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ 
ED NO:26, SEQ ED NO:27, SEQ ED NO:28, SEQ ED NO:29, SEQ ID NO:30, SEQ ED 
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NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, 
SEQ ED NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41, SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:51 , SEQ ID NO:52, 
5 SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 
ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, 
SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71 , SEQ ED NO:72, SEQ ED NO:73, SEQ 
ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ED NO:77, SEQ ED NO:78, SEQ ED 

10 NO:79, SEQ ID NO:80, SEQ ED N0:81 , SEQ ED NO:82, SEQ ID NO:83, SEQ ED NO:84, 

SEQ ED NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ 
ID NO:90, SEQ ED N0:91 , SEQ ED NO:92, SEQ ED NO:93, SEQ ID NO:94, SEQ ED 
NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO: 1 00, SEQ ED NO: 101, SEQ ED NO: 102, SEQ ID NO: 1 03, SEQ ID NO: 1 04, SEQ ID 

15 NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID 

NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID 
NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID 
NO: 120, and SEQ ED NO: 121 , or the corresponding full-length sequences, a functional 
derivative thereof, or a fragment thereof to which the nucleic acid probe will specifically 

20 hybridize. Specific hybridization indicates that in the presence of other nucleic acids the 
probe only hybridizes detectably with the kinase of the invention's target region. Putative 
target regions can be identified by methods well known in the art consisting of alignment 
and comparison of the most closely related sequences in the database. 

In preferred embodiments the nucleic acid probe hybridizes to a kinase target 

25 region encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 
amino acids of the sequence set forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ED 
NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID 
NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID 
NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID 

30 NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ED NO:142, SEQ ED NO:143, SEQ ID 
NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID 
NO: 149, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO: 1 52, SEQ ED NO: 1 53, SEQ ID 
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NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID 
NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID 
NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID 
NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID 

5 NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID 
NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID 
NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:l 87, SEQ ID NO:188, SEQ ID 
NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID 
NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID 

10 NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 

1 5 NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ ED N0.232, SEQ ED NO:233, SEQ ID 
NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or a functional derivative thereof. Hybridization 

20 conditions should be such that hybridization occurs only with the kinase genes in the 

presence of other nucleic acid molecules. Under stringent hybridization conditions only 
highly complementary nucleic acid sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

25 Hybridization conditions should be such that hybridization occurs only with the 

genes in the presence of other nucleic acid molecules. Under stringent hybridization 
conditions only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

30 The diseases for which detection of kinase genes in a sample could be diagnostic 

include diseases in which kinase nucleic acid (DNA and/or RNA) is amplified in 
comparison to normal cells. By "amplification" is meant increased numbers of kinase 
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DNA or RNA in a cell compared with normal cells. In normal cells, kinases are typically 
found as single copy genes. In selected diseases, the chromosomal location of the kinase 
genes may be amplified, resulting in multiple copies of the gene, or amplification. Gene 
amplification can lead to amplification of kinase RNA, or kinase RNA can be amplified in 

5 the absence of kinase DNA amplification. 

"Amplification" as it refers to RNA can be the detectable presence of kinase RNA 
in cells, since in some normal cells there is no basal expression of kinase RNA. In other 
normal cells, a basal level of expression of kinase exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, kinase RNA, 

10 compared to the basal level. 

The diseases that could be diagnosed by detection of kinase nucleic acid in a 
sample preferably include cancers. The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells or nucleic acid extracts of 
cells, or biological fluids. The samples used in the above-described methods will vary 

15 based on the assay format, the detection method and the nature of the tissues, cells or 

extracts to be assayed. Methods for preparing nucleic acid extracts of cells are well 
known in the art and can be readily adapted in order to obtain a sample that is compatible 
with the method utilized. 

Another aspect of the invention involves a method of agonizing (stimulating) or 

20 antagonizing a target of the invention and a natural binding partner associated activity in a 

mammal comprising administering to said mammal an agonist or antagonist to one of the 
above disclosed polypeptides in an amount sufficient to effect said agonism or 
antagonism. A method of treating diseases in a mammal with an agonist or antagonist of 
the protein of the present invention activity comprising administering the agonist or 

25 antagonist to a mammal in an amount sufficient to agonize or antagonize associated 

functions is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein polypeptides. Some small organic molecules form a class of compounds that 

30 modulate the function of protein polypeptides. Examples of molecules that have been 
reported to inhibit the function of protein kinases include, but are not limited to, bis 
monocyclic, bicyclic or heterocyclic aryl compounds (PCT WO 92/20642, published 
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November 26, 1992 by Maguire et al), vinylene-azaindole derivatives (PCT WO 
94/14808, published July 7, 1994 by Ballinari et a/.), l-cyclopropyl-4-pyridyl-quinolones 
(U.S. Patent No. 5,330,992), styryl compounds (U.S. Patent No. 5,217,999), styryl- 
substituted pyridyl compounds (U.S. Patent No. 5,302,606), certain quinazoline 
5 derivatives (EP Application No. 0 566 266 Al), seleoindoles and selenides (PCT WO 
94/03427, published February 17, 1994 by Denny et al.), tricyclic polyhydroxylic 
compounds (PCT WO 92/21660, published December 10, 1992 by Dow), and 
benzylphosphonic acid compounds (PCT WO 91/15495, published October 17, 1991 by 
Dow et al), all of which are incorporated by reference herein, including any drawings. 

1 o Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein inhibitors only 
weakly inhibit function. In addition, many inhibit a variety of protein kinases and will 
therefore cause multiple side-effects as therapeutics for diseases. 

15 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et al) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar groups including hydroxylated alky], 

20 phosphate, and ether substituents. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996, entitled 'Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
221/187) and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 

25 223/298) and International Patent Publication WO 96/22976, published August 1 , 1 996 by 

Ballinari et al, all of which are incorporated herein by reference in their entirety, 
including any drawings, describe indolinone chemical libraries of indolinone compounds 
harboring other bicyclic moieties as well as monocyclic moieties fused to the oxindole 
ring. Applications 08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial 

30 Libraries and Related Products and Methods for the Treatment of Disease" by Tang et al 
(Lyon & Lyon Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled 
"Benzylidene-Z-Indoline Compounds for the Treatment of Disease" by Tang ei al (Lyon 
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& Lyon Docket No. 223/298), and WO 96/22976, published August 1, 1996 by Ballinari 
et al. teach methods of indolinone synthesis, methods of testing the biological activity of 
indolinone compounds in cells, and inhibition patterns of indolinone derivatives, both of 
which are incorporated by reference herein, including any drawings. 
5 Other examples of substances capable of modulating kinase activity include, but 

are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al., EPO Publication No. 0 520 722 

10 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et al., U.S. Patent No. 4,757,072; Kaul 

and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al; Barker et al., Proc. 
of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 3:293- 
304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et ah, Br. J. 

15 Cancer 53:361-368 (1986); Fernandes et al., Cancer Research 43:1 1 17-1 123 (1983); Ferris 

et al. J. Org. Chem. 44(2):173-178; Fry et al., Science 265:1093-1095 (1994); Jackman et 
al., Cancer Research 5 1 :5579-5586 (1981); Jones et al. J. Med. Chem. 29(6): 1114-1118; 
Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al., J. Org. Chem. 
54:351 1-3518 (1989); Ley and Seng, Synthesis 3975:415-522 (1975); Maxwell et al., 

20 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al., Cancer Research 

45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 1 7(19):1489-1 596 (1980); 
Reece et al., Cancer Research 47(1 1):2996-2999 (1 977); Sculier et al., Cancer Immunol, 
and lmmunother. 23:A65 (1986); Sikora et al., Cancer Letters 23:289-295 (1984); and 
Sikora et al., Analytical Biochem. 172:344-355 (1988), all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

Quinolines are described in Dolle et al., J. Med. Chem. 37:2627-2629 (1994); 
MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 

30 (1993); and Burke et al. BioOrgamc Med. Chem. Letters 2:1771-1774 (1992), all of which 

are incorporated by reference in their entirety, including any drawings. 
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Tyrphostins are described in Allen et al., Clin. Exp. Immunol. 91:141-156 (1993); 
Anafi et al., Blood 82:12:3524-3529 (1993); Baker et al., J. Cell Sci. 102:543-555 (1992); 
Bilder et al Amer. Phvsiol. Soc. pp. 6363-61 43:C721-C730 (1991); Brunton et al., 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al., Experimental 

5 Cell Research 199:255-261 (1992^: Done et al.. J. Leukocyte Biology 53:53-60 (1993); 
Dong et al., J. Immunol. 151(5):2717-2724 (1993); Gazit et al., J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al., " J. Med. Chem. 36:3556-3564 (1993); Kaur et al., Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al., Kinget al.. Biochem. J. 275:413-418 (1991); Kuo et 
al., Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 

10 Lyall et al., J. Biol. Chem. 264:14503-14509 (1989); Peterson et al., The Prostate 22:335- 

345 (1993); Pillemer et al., Int. J. Cancer 50:80-85 (1992); Posner et al., Molecular 
Pharmacology 45:673-683 (1993); Rendu et al., Biol. Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, I 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al., LBjol 

15 Chem. 269(36):22470-22472 (1994); and Yoneda et al., Cancer Research 51:4430-4435 

(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 

Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 

20 incorporated herein by reference in its entirety, including any drawings. 
Methods of Treating a Disease (Enablement - i.e.. Dosing*) 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 

25 WO 96/22976, published August 1 1 996, both of which are incorporated herein by 

reference in their entirety, including any drawings, figures or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
30 treated, the particular composition being used and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
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formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

5 tumors and major organs can also be determined to facilitate the selection of drugs most 

appropriate to inhibit a disorder. Such measurements can be carried out. For example, 
HPLC analysis can be performed on the plasma of animals treated with the drug and the 
location of radiolabeled compounds can be deter-mined using detection methods such as 
X-ray, CAT scan and MRI. Compounds that show potent inhibitory activity in the 

1 0 screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 

15 1) the compound is administered to mice (an untreated control mouse should also be 

used); 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A 
comparison of results for each dosing regime with the controls indicates if toxicity is 

20 present. 

At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
F.nthanasia. Journal of American Veterinary Medical Assoc. , 202:229-249, 1993). 

25 Representative animals from each treatment group can then be examined by gross 

necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross 
abnormalities in tissue are noted and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 

30 effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 
5 Plasma levels should reflect the potency of the drug. Generally, the more potent 

the compound the lower the plasma levels necessary to achieve efficacy. 

In a final aspect, the invention features a method for detection of a kinase 
polypeptide in a sample as a diagnostic tool for a disease or disorder, wherein the method 
comprises: (a) comparing a nucleic acid target region encoding the kinase polypeptide in 

10 a sample, where the kinase polypeptide is selected from the group consisting of SEQ ID 

NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 3 5, SEQ ID NO: 1 3 6, SEQ ID 
NO: 137, SEQ ID NO:138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 

15 NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ED NO: 150, SEQ ID NO:151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ID 
NO:l 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 
NO: 162, SEQ ED NO:163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 

20 NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ ED NO: 180, SEQ ID NO:181, SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 

25 NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ED NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED 

30 NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ID 
NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
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NO:232, SEQ ID NO:233, SEQ ID NO:234 5 SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or one or more fragments thereof, with a control nucleic acid target region 
encoding the kinase polypeptide, or one or more fragments thereof; and (b) detecting 
5 differences in sequence or amount between the target region and the control target region, 

as an indication of the disease or disorder. Preferably, the disease or disorder is selected 
from the group consisting of immune-related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal failure, oxidative stress-related 
neurodegenerative disorders, and cancer. Immune-related diseases and disorders include, 

1 0 but are not limited to, those discussed previously. 

The term "comparing" as used herein refers to identifying discrepancies between 
the nucleic acid target region isolated from a sample, and the control nucleic acid target 
region. The discrepancies can be in the nucleotide sequences, e.g. insertions, deletions, or 
point mutations, or in the amount of a given nucleotide sequence. Methods to determine 

1 5 these discrepancies in sequences are well-known to one of ordinary skill in the art. The 

"control" nucleic acid target region refers to the sequence or amount of the sequence 
found in normal cells, e.g. cells that are not diseased as discussed previously. 
The term also includes anti-sense molecules drawn thereto. 
The invention has been described broadly and generically herein. Each of the 

20 narrower species and subgeneric groupings falling within the generic disclosure also form 

part of the invention. This includes the generic description of the invention with a proviso 
or negative limitation removing any subject matter from the genus, regardless of whether 
or not the excised material is specifically recited herein. For example, in some instances 
the nucleotide sequence of particular kinase polypeptides may not be part of a preferred 

25 embodiment. 

The summary of the invention described above is not limiting and other features 
and advantages of the invention will be apparent from the following detailed description of 
the invention, and from the claims. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figures 1 A to IBB shows the amino acid sequences of SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ EDNO:124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 
5 NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 148, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ED NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 

10 NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO:165. SEQ ID NO: 166, SEQ ED NO: 167, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ED 
NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO:1 8 1 , SEQ ID NO: 1 82, SEQ ID 

15 NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ED NO: 1 99, SEQ ID 
NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO: 198, SEQ ED NO: 199, SEQ ID NO.200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID 

20 NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 

NO:213, SEQ IDNO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID 
NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ID 

25 NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID 

NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ED NO:242. 

Figures 2A to 2MMMM shows the nucleic acid sequences of SEQ ID NO:l, SEQ 
ED NO:2, SEQ ED NO:3, SEQ ED NO:4, SEQ ED NO:5, SEQ ED NO:6, SEQ ED NO:7, 
SEQ ED NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ED NO: 1 2, SEQ ID 

30 NO: 1 3, SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, 
SEQ ED NO: 1 9, SEQ ED NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ 
ID NO:24, SEQ ED NO:25, SEQ ED NO:26, SEQ ED NO:27, SEQ ID NO:28, SEQ ID 
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NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ 
ID NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
SEQ ED N0:51 , SEQ ID NO:52, SEQ ED NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ 
ID NO:56, SEQ ED NO:57, SEQ ED NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ED NO:67, SEQ ID NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ID N0:71, SEQ 
ED NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, 
SEQ ED NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ 
ID NO:88, SEQ ED NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, 
SEQ ED NO: 1 04, SEQ ED NO: 1 05, SEQ ED NO: 1 06, SEQ ID NO: 1 07, SEQ ED NO: 1 08, 
SEQ ID NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED NO: 1 1 1 , SEQ ED NO: 1 1 2, SEQ ED NO: 1 1 3, 
SEQ ID NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ED NO: 1 1 6, SEQ ED NO: 1 1 7, SEQ ED NO: 1 1 8, 
SEQ ED NO:l 19, SEQ ED NO:120, and SEQ ED NO: 121. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates in part to kinase polypeptides, nucleic acids encoding 
such polypeptides, cells containing such nucleic acids, antibodies to such polypeptides, 
assays utilizing such polypeptides, and methods relating to all of the foregoing. The 
present invention is based upon the isolation and characterization of new kinase 
polypeptides. The polypeptides and nucleic acids may be produced using well-known and 
standard synthesis techniques when given the sequences presented herein. 

I. The Nucleic Acids of the Invention 

Included within the scope of this invention are the functional equivalents of the 
herein-described isolated nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other codons that specify the same amino acid 
and hence would give rise to the same protein. The nucleic acid sequence can vary 
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substantially since, with the exception of methionine and tryptophan, the known amino 
acids can be coded for by more than one codon. Thus, portions or all of the kinase genes 
of the invention could be synthesized to give a nucleic acid sequence significantly 
different from one selected from the group consisting of those set forth in SEQ ID NO:l , 

5 SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11 , SEQ ID NO:12, 
SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ 
ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, 

10 SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ 
ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ED NO:37, SEQ ED NO:38, SEQ ED 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ED NO:43, SEQ ID NO:44, 
SEQ ED NO:45, SEQ ED NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ 
ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 

15 NO:55, SEQ ID NO:56, SEQ ED NO:57, SEQ ID NO:58, SEQ ED NO:59, SEQ ID NO:60, 

SEQ ED NO:61, SEQ ID NO:62, SEQ ED NO:63, SEQ ED NO:64, SEQ ID NO:65, SEQ 
ID NO:66, SEQ ED NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71 , SEQ ED NO:72, SEQ ED NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, 
SEQ ED NO:77, SEQ ED NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ID NO:81, SEQ 

20 ID NO:82, SEQ ED NO:83, SEQ ED NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 

NO:87, SEQ ID NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ED NO:91 , SEQ ID NO:92, 
SEQ ED NO:93, SEQ ED NO:94, SEQ ID NO:95, SEQ ED NO:96, SEQ ED NO:97, SEQ 
ID NO:98, SEQ ED NO:99, SEQ ED NO: 100, SEQ ID NO: 101, SEQ ED NO: 102, SEQ ID 
NO: 1 03, SEQ ED NO: 1 04, SEQ ED NO: 1 05, SEQ ID NO: 1 06, SEQ ED NO: 1 07, SEQ ID 

25 NO: 1 08, SEQ ID NO: 1 09, SEQ ED NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ED 

NO: 113, SEQ ED NO:l 14, SEQ ED NO:l 15, SEQ ID NO:l 16, SEQ ID NO: 1 17, SEQ ID 
NO: 11 8, SEQ ID NO: 11 9, SEQ ID NO: 120, and SEQ ID NO: 121. The encoded ammo 
acid sequence thereof would, however, be preserved. 

In addition, the nucleic acid sequence may comprise a nucleotide sequence which 

30 results from the addition, deletion or substitution of at least one nucleotide to the 5 '-end 
and/or the 3'-end of the nucleic acid sequence shown in SEQ ID NO:l, SEQ ID NO:2, 
SEQ ED NO:3, SEQ ED NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID 



WO 00/73469 



76 



PCT7US00/14842 



NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, 
SEQ ED NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ED NO:17, SEQ ID NO:18, SEQ 
ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ED NO:54, SEQ ID NO:55, SEQ ID 
NO:56, SEQ ED NO:57, SEQ ED NO:58, SEQ ED NO:59, SEQ ID NO:60, SEQ ID NO:61 , 
SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ 
ED NO:67, SEQ ID NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ID NO:71, SEQ ED 
NO:72, SEQ ED NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, 
SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ED NO:82, SEQ 
ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ED NO:87, SEQ ID 
NO:88, SEQ ID NO:89, SEQ ED NO:90, SEQ ED NO:91 , SEQ ED NO:92, SEQ ID NO:93, 
SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ 
ID NO:99, SEQ ID NO: 1 00, SEQ ID NO: 1 0 1 , SEQ ID NO: 1 02, SEQ ID NO: 1 03 , SEQ ID 
NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID 
NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID 
NO:l 14, SEQ ID NO:l 15, SEQ ID NO:l 16, SEQ ID NO:l 17, SEQ ID NO.l 18, SEQ ID 
NO: 11 9, SEQ ED NO: 120, and SEQ ID NO: 121, or a derivative thereof. Any nucleotide 
or polynucleotide may be used in this regard, provided that its addition, deletion or 
substitution does not alter the amino acid sequence of SEQ ID NO: 122, SEQ ED NO: 123, 
SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ ED NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 1 28, 
SEQ ED NO: 129, SEQ ED NO: 130, SEQ DDNO:131, SEQ DD NO: 132, SEQ ID NO: 133, 
SEQ ED NO: 134, SEQ ED NO: 135, SEQ DD NO: 136, SEQ ED NO: 137, SEQ ED NO: 138, 
SEQ ED NO:139, SEQ DD NO:140, SEQ DD NO:141, SEQ DD NO:142, SEQ DD NO:143, 
SEQ DD NO:144, SEQ DD NO:145, SEQ DD NO:146, SEQ DD NO:147, SEQ DD NO:148, 
SEQ DD NO: 149, SEQ DD NO: 150, SEQ DD NO: 151, SEQ ED NO: 152, SEQ ED NO: 153, 
SEQ ED NO: 1 54, SEQ DD NO: 1 55, SEQ DD NO: 1 56, SEQ ID NO: 157, SEQ ID NO: 1 58, 
SEQ ID NO: 1 59, SEQ DD NO: 1 60, SEQ DD NO: 1 61 , SEQ DD NO: 1 62, SEQ ID NO:l 63, 
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SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, 
SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO:l 76, SEQ ID NO: 1 77, SEQ ID NO: 178, 
SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, 

5 SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, 
SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 

10 SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 

SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:21 8, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, 

15 SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 

SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242, that is encoded 
by the nucleotide sequence. For example, the present invention is intended to include any 
nucleic acid sequence resulting from the addition of ATG as an initiation codon at the 5'- 
end of the inventive nucleic acid sequence or its derivative, or from the addition of TTA, 

20 TAG or TGA as a termination codon at the 3'-end of the inventive nucleotide sequence or 
its derivative. Moreover, the nucleic acid molecule of the present invention may, as 
necessary, have restriction endonuclease recognition sites added to its 5' -end and/or 3'- 
end. 

Such functional alterations of a given nucleic acid sequence afford an opportunity 
25 to promote secretion and/or processing of heterologous proteins encoded by foreign 

nucleic acid sequences fused thereto, for example. All variations of the nucleotide 
sequence of the kinase genes of the invention and fragments thereof permitted by the 
genetic code are, therefore, included in this invention. 

Further, it is possible to delete codons or to substitute one or more codons with 
30 codons other than degenerate codons to produce a structurally modified polypeptide, but 
one which has substantially the same utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art, the two polypeptides are 
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functionally equivalent, as are the two nucleic acid molecules that give rise to their 
production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. This is discussed further in the "Functional 
Derivatives" section, herein. 
5 Finally, many of the nucleic acid molecules of the invention are provided as a 

partial sequence only (Fig. 2 A through 2QQ). However, it is standard for one of ordinary 
skill in the art to obtain a full-length sequence when provided with a partial sequence. 
Similarly, when provided with a partial or full-length sequence it is standard for one of 
ordinary skill in the art to obtain nucleic acid sequence coding for homologous proteins. 

10 Therefore, these nucleic acid molecules are also part of the invention. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1. The protein kinases fall into 10 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EIFK, LIMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

1 5 presumably define new protein kinase groups. 

Additional characteristics may be found, inter alia, in the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, shown below. 

II. Nucleic Acid Probes, Methods, and Kits for Detection of Protein Kinases. 

20 A nucleic acid probe of the present invention may be used to probe an appropriate 

chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 

25 Sambrook, Fritsch, & Maniatis, eds., 1989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences that correspond to N-terminal, kinase or C- 
terminal portions, for example, of the amino acid sequence of the polypeptide of interest. 
The synthesized nucleic acid probes may be used as primers in a polymerase chain 

30 reaction (PCR) carried out in accordance with recognized PCR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and Applications", Academic Press, 
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Michael, et a/., eds., 1990, utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment of the present invention. 

One skilled in the art can readily design such probes based on the sequence 
disclosed herein using methods of computer alignment and sequence analysis known in 
5 the art ("Molecular Cloning: A Laboratory Manual", 1 989, supra). The hybridization 

probes of the present invention can be labeled by standard labeling techniques such as 
with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 
The nucleic acid probes of the present invention include RNA, as well as DNA 

10 probes, such probes being generated using techniques known in the art. The nucleic acid 

probe may be immobilized on a solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 

15 art. 

The test samples suitable for nucleic acid probing methods of the present invention 
include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 

20 preparing nucleic acid extracts of cells are well known in the art and can be readily 

adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting said sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of said probe 

25 bound to said nucleic acid molecule. One skilled in the art would select the nucleic acid 

probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 

30 nucleic acid probe. The kit may further comprise other containers comprising one or more 

of the following: wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
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radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or steptavidin). 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 

5 or strips of plastic or paper. Such containers allow the efficient transfer of reagents from 

one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 

10 primers used in the assay, containers which contain wash reagents (such as phosphate 

buffered saline, Tris-buffers, and the like), and containers which contain the reagents used 
to detect the hybridized probe, bound antibody, amplified product, or the like. One skilled 
in the art will readily recognize that the nucleic acid probes described in the present 
invention can readily be incorporated into one of the established kit formats that are well 

1 5 known in the art. 

m. DNA Constructs Comprising a Protein Kinase Nucleic Acid Molecule and Cells 
Containing These Constructs. 

The present invention also relates to a recombinant DNA molecule comprising, 5' 
20 to 3', a promoter effective to initiate transcription in a host cell and the above-described 
nucleic acid molecules. In addition, the present invention relates to a recombinant DNA 
molecule comprising a vector and an above-described nucleic acid molecule. The present 
invention also relates to a nucleic acid molecule comprising a transcriptional region 
functional in a cell, a sequence complementary to an RNA sequence encoding an amino 
25 acid sequence corresponding to the above-described polypeptide, and a transcriptional 

termination region functional in said cell. The above-described molecules may be isolated 
and/or purified DNA molecules. 

The present invention also relates to a cell or organism that contains an above- 
described nucleic acid molecule and thereby is capable of expressing a polypeptide. The 
30 polypeptide may be purified from cells that have been altered to express the polypeptide. 
A cell is said to be "altered to express a desired polypeptide" when the cell, through 
genetic manipulation, is made to produce a protein which it normally does not produce or 
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which the cell normally produces at lower levels. One skilled in the art can readily adapt 
procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
5 polypeptide if it contains nucleotide sequences which contain transcriptional and 

translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 

10 regulatory regions needed for gene sequence expression may vary from organism to 

organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those 5 '-non-coding sequences involved with initiation of 

15 transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 

and the like. 

If desired, the non-coding region 3 5 to the sequence encoding a kinase of the 
invention may be obtained by the above-described methods. This region may be retained 
for its transcriptional termination regulatory sequences, such as termination and 

20 polyadenylation. Thus, by retaining the 3'-region naturally contiguous to the DNA 

sequence encoding a kinase of the invention, the transcriptional termination signals may 
be provided. Where the transcriptional termination signals are not satisfactorily functional 
in the expression host cell, then a 3' region functional in the host cell may be substituted. 
Two DNA sequences (such as a promoter region sequence and a sequence 

25 encoding a kinase of the invention) are said to be operably linked if the nature of the 

linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the invention, or (3) interfere with 
the ability of the gene sequence of a kinase of the invention to be transcribed by the 

30 promoter region sequence. Thus, a promoter region would be operably linked to a DNA 

sequence if the promoter were capable of effecting transcription of that DNA sequence. 
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Thus, to express a gene encoding a kinase of the invention, transcriptional and 
translational signals recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding a kinase of 
the invention (or a functional derivative thereof) in either prokaryotic or eukaryotic cells. 
5 Prokaryotic hosts are, generally, very efficient and convenient for the production of 

recombinant proteins and are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryotes most frequently are represented by various strains of 
E. coli. However, other microbial strains may also be used, including other bacterial 
strains. 

10 In prokaryotic systems, plasmid vectors that contain replication sites and control 

sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUC119 and the like; suitable 
phage or bacteriophage vectors may include ygtlO, ygtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 

15 present invention has the capacity to replicate in the selected host cell. 

Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, 
Sireptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 

20 To express a kinase of the invention (or a functional derivative thereof) in a 

prokaryotic cell, it is necessary to operably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such promoters may be either constitutive 
or, more preferably, regulatable {i.e., inducible or derepressible). Examples of constitutive 
promoters include the int promoter of bacteriophage X, the bla promoter of the P- 

25 lactamase gene sequence of pBR322, and the cat promoter of the chloramphenicol acetyl 

transferase gene sequence of pPR325, and the like. Examples of inducible prokaryotic 
promoters include the major right and left promoters of bacteriophage X (P L and P R ), the 
trp, recA, AacZ, AacI, and gal promoters of E. coli, the a-amylase (Ulmanen et al, J. 
Bacteriol. 162:176-182, 1985) and the c-28-specific promoters of B. subtilis (Gilman et 

30 al, Gene Sequence 32:1 1-20, 1984), the promoters of the bacteriophages of Bacillus 

(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY, 1 982), and 
Sireptomyces promoters (Ward et al % Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic 
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promoters are reviewed by Glick (Ind. Microbiot. 1 :277-282, 1987), Cenatiempo 
(Biochimie 68:505-516, 1986), and Gottesman (Ann. Rev. Genet. 18:415-442, 1984). 

Proper expression in a prokaryotic cell also requires the presence of a ribosome- 
binding site upstream of the gene sequence-encoding sequence. Such ribosome-binding 
5 sites are disclosed, for example, by Gold et al (Ann. Rev. Microbiol. 35:365-404, 1981). 

The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture" may be used interchangeably and all such 
designations include progeny. Thus, the words "transformants" or "transformed cells" 

10 include the primary subject cell and cultures derived therefrom, without regard to the 

number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 
progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 

15 are not strictly limited, provided that they are suitable for use in the expression of the 

kinase polypeptide of interest. Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, fungi, insect cells, mammalian cells 
either in vivo, or in tissue culture. Mammalian cells which may be useful as hosts include 
HeLa cells, cells of fibroblast origin such as VERO or CHO-K1 , or cells of lymphoid 

20 origin and their derivatives. Preferred mammalian host cells include SP2/0 and J558L, as 

well as neuroblastoma cell lines such as EV1R 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

25 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 

preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors can be engineered to express 
large amounts of kinases of the invention in insect cells (Jasny, Science 238:1653, 1987; 

30 Miller et al, In: Genetic Engineering, Vol. 8, Plenum, Setlow et al, eds., pp. 277-297, 

1986). 
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Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 
rich in glucose. Known glycolytic gene sequences can also provide very efficient 
transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 
cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of kinases of 
the invention in a mammalian host. 

A wide variety of transcriptional and translation^ regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine 
papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic hosts requires the use of 
eukaryotic regulatory regions. Such regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter of the mouse metallothionein 1 gene sequence (Hamer 
et al, J. MoL Appl. Gen. 1 :273-288, 1982); the TK promoter of Herpes virus (McKnight, 
Cell 31:355-365, 1982); the SV40 early promoter (Benoist et ai, Nature (London) 
290:304-31, 1981); and the yeast gal4 gene sequence promoter (Johnston et ai, Proc. Natl. 
Acad. Sci. (USA) 79:6971-6975, 1982; Silver etai, Proc. Natl. Acad. Sci. (USA) 
81:5951-5955, 1984). 
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Translation of eukaryotic mRNA is initiated at the codon that encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes a kinase of the invention (or a 
functional derivative thereof) does not contain any intervening codons which are capable 
of encoding a methionine (i.e., AUG). The presence of such codons results either in the 
formation of a fusion protein (if the AUG codon is in the same reading frame as the kinase 
of the invention coding sequence) or a frame-shift mutation (if the AUG codon is not in 
the same reading frame as the kinase of the invention coding sequence). 

A nucleic acid molecule encoding a kinase of the invention and an operably linked 
promoter may be introduced into a recipient prokaryotic or eukaryotic cell either as a 
nonreplicating DNA or RNA molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous replication, the expression of the gene may occur through the transient 
expression of the introduced sequence. Alternatively, permanent expression may occur 
through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
sequence can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 
transcription promoters, enhancers, and termination signals. cDNA expression vectors 
incorporating such elements include those described by Okayama (Mol. Cell. Biol. 3:280-, 
1983). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected from those recipient cells which do not contain the vector; 
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the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in E. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184, ttVX; "Molecular 
5 Cloning: A Laboratory Manual", 1989, supra). Bacillus plasmids include pCl 94, pC221, 

pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi J 101 (Kendall et ai, 
J. Bacteriol. 169:4177-4183, 1987), and streptomyces bacteriophages such as 4>C31 
...(Chater et al. 9 In: Sixth In]erna.tio_nal_Symposium on Actinomycetales Biology, Akademiai 
1 0 Kaido, Budapest, Hungary, pp. 45-54, 1 986). Pseudomonas plasmids are reviewed by 

John et al (Rev. Infect. Dis. 8:693-704, 1986), and Izaki (Jpn. J. Bacteriol. 33:729-742, 
1978). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, S V40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 

15 (Botstein et aL, Miami Wntr. Symp. 19:265-274, 1982; Broach, In: "The Molecular 

Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, Cell 28:203-204, 1982; 
Bollon et aL, J. Clin. Hematol. Oncol. 10:39-48, 1980; Maniatis, In: Cell Biology: A 
Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 

20 563-608, 1980). 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 

25 precipitation, direct microinjection, and the like. After the introduction of the vector, 

recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of a kinase of 
the invention, or fragments thereof. This can take place in the transformed cells as such, 
or following the induction of these cells to differentiate (for example, by administration of 

30 bromodeoxyuracil to neuroblastoma cells or the like). A variety of incubation conditions 
can be used to form the peptide of the present invention. The most preferred conditions 
are those which mimic physiological conditions. 
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IV. The Proteins of the Invention 

A variety of methodologies known in the art can be utilized to obtain the 
polypeptides of the present invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. Alternatively, the above-described isolated 
nucleic acid fragments could be used to express the kinases of the invention in any 
organism. The samples of the present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The samples will vary based on the assay 
format, the detection method, and the nature of the tissues, cells or extracts used as the 
sample. 

Any eukaryotic organism can be used as a source for the polypeptides of the 
invention, as long as the source organism naturally contains such polypeptides. As used 
herein, "source organism" refers to the original organism from which the amino acid 
sequence of the subunit is derived, regardless of the organism the subunit is expressed in 
and ultimately isolated from. 

One skilled in the art can readily follow known methods for isolating proteins in 
order to obtain the polypeptides free of natural contaminants. These include, but are not 
limited to: size-exclusion chromatography, HPLC, ion-exchange chromatography, and 
immuno-affinity chromatography. 

Further, the polypeptides of the invention include the full-length polypeptides that 
can be identified from the full-length or partial sequences encoded by SEQ ID NO: 122, 
SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO.T26, SEQ ID NO:127, 
SEQ ED NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, 
SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID NO:147, 
SEQ ED NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ED NO: 151, SEQ ID NO: 152, 
SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, 
SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ED NO: 161, SEQ ID NO: 162, 
SEQ ED NO:163, SEQ ED NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ED NO:167, 
SEQ ED NO: 1 68, SEQ ED NO: 169, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ED NO:l 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 



WO 00/73469 



PCTAJS00/14842 



SEQ ID NO: 1 83, SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, 
SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, 
SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ED NO: 196, SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, 
5 SEQ ID NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, 

SEQ ID NO:208, SEQ ED NO:209, SEQ ED NO:21 0, SEQ ED NO:21 1 , SEQ ED NO:2 12, 
SEQ ED NO:213, SEQ ED NO:214, SEQ ED N0.215, SEQ ED NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ EDNO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222, 
SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, 

10 SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 
NO:242 (Figure 1). In addition, the polypeptides of the invention include the domains of 
these polypeptides, including, but not limited to, the N-terminal, kinase/catalytic, and C- 

15 terminal domains. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1 . The protein kinases fall into 1 0 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EEFK, LEMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

20 presumably define new protein kinase groups. 

Additional characteristics are shown in, inter alia, the tables, namely Table 1, 
Table 2, Table 3 and Table 4, provided below. 

V. Antibodies, Hvbridomas, Methods of Use and Kits for Detection of Protein 

25 Kinases 

The present invention relates to an antibody having binding affinity to a kinase of 
the invention. The polypeptide may have an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ 
ED NO: 125, SEQ ED NO: 1 26, SEQ ED NO: 1 27, SEQ ED NO: 1 28, SEQ ID NO: 1 29, SEQ 

30 ED NO: 130, SEQ ED NO: 131, SEQ ED NO: 132, SEQ ED NO: 133, SEQ ED NO:134, SEQ 
ED N0.135, SEQ ED NO:136, SEQ ED NO:137, SEQ ED NO:138, SEQ ED NO:139, SEQ 
ED NO:140, SEQ ED NO.T41, SEQ ED NO:142, SEQ ED NO:143, SEQ ID NO:144, SEQ 
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20 ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or a functional derivative thereof, or 

at least 9 contiguous amino acids thereof (preferably, at least 20, 30, 35, or 40 or more 
contiguous amino acids thereof). Alternatively, the antibody may bind to a part of the 
polypeptide not provided in the sequences above, but that is present in the full-length 
sequence of the polypeptide and that is easily obtained using methods standard in the art. 

25 Further, the antibody may bind specifically to particular domains of one or more of the 
kinases of the invention, including, but not, limited to, the N-terminal, kinase/catalytic, or 
C-terminal domains. 

The present invention also relates to an antibody having specific binding affinity to 
a kinase or kinase domain of the invention. Such an antibody may be isolated by 
30 comparing its binding affinity to a kinase of the invention with its binding affinity to other 

polypeptides. Those that bind selectively to a kinase of the invention would be chosen for 
use in methods requiring a distinction between a kinase of the invention and other 
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polypeptides. Such methods could include, but should not be limited to, the analysis of 
altered kinase expression in tissue containing other polypeptides. 

The kinases of the present invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for use in identifying pharmaceutical 
5 compositions, and for studying DNA/protein interaction. 

The kinases of the present invention can be used to produce antibodies or 
hybridomas. One skilled in the art will recognize that if an antibody is desired, such a 
peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 
10 of these antibodies, and humanized forms. Humanized forms of the antibodies of the 

present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

The present invention also relates to a hybridoma that produces the above- 
described monoclonal antibody, or binding fragment thereof. A hybridoma is an 
15 immortalized cell line that is capable of secreting a specific monoclonal antibody. 

In genera], techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et aL, J. Immunol. Methods 35:1-21, 1980). 
20 Any animal (mouse, rabbit, and the like) which is known to produce antibodies can be 

immunized with the selected polypeptide. Methods for immunization are well known in 
the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 
25 polypeptide and the site of injection. 

The polypeptide may be modified or administered in an adjuvant in order to 
increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or p-galactosidase) or through the inclusion of an 
30 adjuvant during immunization. 
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For monoclonal antibodies, spleen cells from the immunized animals are removed, 
fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
5 the desired characteristics. These include screening the hybridomas with an ELISA assay, 

western blot analysis, or radioimmunoassay (Lutz et al, Exp. Cell Res. 175:109-124, 
1988). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", supra, 
10 1984). 

For polyclonal antibodies, antibody-containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 

15 radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et al, 
J. Histochem. Cytochem. 18:315, 1970; Bayer et aL 9 Meth. Enzym. 62:308-, 1979; Engval 

20 et al, Immunol. 109:129-, 1972; Goding, J. Immunol. Meth. 13:215-, 1976. The labeled 

antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 

25 carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 

and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 1 0, 1986; Jacoby et al., Meth. Enzym. 
34, Academic Press, N.Y., 1974). The immobilized antibodies of the present invention 

30 can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 
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Furthermore, one skilled in the art can readily adapt currently available procedures, 
as well as the techniques, methods and kits disclosed herein with regard to antibodies, to 
generate peptides capable of binding to a specific peptide sequence in order to generate 
rationally designed antipeptide peptides (Hurby et ai, "Application of Synthetic Peptides: 
5 Antisense Peptides", In Synthetic Peptides, A User's Guide, W.H. Freeman, NY, pp. 289- 

307, 1992; Kaspczak et ai, Biochemistry 28:9230-9238, 1989). 

Anti-peptide peptides can be generated by replacing the basic amino acid residues 
found in the peptide sequences of the kinases of the invention with acidic residues, while 
maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, 
10 and/or histidine residues are replaced with aspartic acid or glutamic acid and glutamic acid 

residues are replaced by lysine, arginine or histidine. 

The present invention also encompasses a method of detecting a kinase 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 
1 5 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. Altered levels of a kinase of the 
invention in a sample as compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a test sample vary. Incubation 
20 conditions depend on the format employed in the assay, the detection methods employed, 

and the type and nature of the antibody used in the assay. One skilled in the art will 
recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 
25 the present invention. Examples of such assays can be found in Chard ("An Introduction 

to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et ah ("Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
30 Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1 985). 
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The immunological assay test samples of the present invention include cells, 
protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
urine. The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
5 to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 
well known in the art and can be readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
10 above-described antibody, and (ii) second container means containing a conjugate 

comprising a binding partner of the antibody and a label. In another preferred 
embodiment, the kit further comprises one or more other containers comprising one or 
more of the following: wash reagents and reagents capable of detecting the presence of 
bound antibodies. 

15 Examples of detection reagents include, but are not limited to, labeled secondary 

antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
kits. One skilled in the art will readily recognize that the antibodies described in the 

20 present invention can readily be incorporated into one of the established kit formats that 

are well known in the art. 

VI. Isolation of Compounds That Interact With Protein Kinases 

The present invention also relates to a method of detecting a compound capable of 
binding to a protein kinase of the invention, comprising incubating the compound with a 
25 kinase of the invention and detecting the presence of the compound bound to the kinase. 

The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. 

The present invention also relates to a method of detecting an agonist or antagonist 
of kinase activity or kinase binding partner activity comprising incubating cells that 
30 produce a kinase of the invention in the presence of a compound and detecting changes in 
the level of kinase activity or kinase binding partner activity. The compounds thus 
identified would produce a change in activity indicative of the presence of the compound. 
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The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. Once the compound is identified it can be isolated using techniques well 
known in the art. 

The present invention also encompasses a method of agonizing (stimulating) or 
5 antagonizing kinase associated activity in a mammal comprising administering to said 

mammal an agonist or antagonist to a kinase of the invention in an amount sufficient to 
effect said agonism or antagonism. A method of treating diseases in a mammal with an 
agonist or antagonist of kinase activity comprising administering the agonist or antagonist 
to a mammal in an amount sufficient to agonize or antagonize kinase associated functions 

10 is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein kinases. Some small organic molecules form a class of compounds that modulate 
the function of protein kinases. Examples of molecules that have been reported to inhibit 

15 the function of protein kinases include, but are not limited to, bis monocyclic, bicyclic or 

heterocyclic aryl compounds (PCT WO 92/20642, published November 26, 1992 by 
Maguire et al), vinylene-azaindole derivatives (PCT WO 94/14808, published July 7, 
1994 by Ballinari et al\ l-cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), 
styryl compounds (U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. 

20 Patent No. 5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 Al), 

seleoindoles and selenides (PCT WO 94/03427, published February 17, 1994 by Denny et 
al\ tricyclic polyhydroxylic compounds (PCT WO 92/21660, published December 10, 
1992 by Dow), and benzylphosphonic acid compounds (PCT WO 91/15495, published 
October 17, 1991 by Dow et al). 

25 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein kinase inhibitors 
only weakly inhibit the function of protein kinases. In addition, many inhibit a variety of 
protein kinases and will cause multiple side-effects as therapeutics for diseases. 

30 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1 , 1 996 by 
Ballinari et al) describes hydrosoluble indolinone compounds that harbor tetralin, 
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naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar moieties including hydroxylated alkyl, 
phosphate, and ether moieties. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
5 Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 22 1/1 87) 

and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 223/298) and 
International Patent Publication WO 96/22976, published August 1, 1996 by Ballinari et 
al, all of which are incorporated herein by reference in their entirety, including any 

10 drawings, describe indolinone chemical libraries of indolinone compounds harboring other 

bicyclic moieties as well as monocyclic moieties fused to the oxindole ring. Applications 
08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon 
Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 

1 5 Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 

223/298), and WO 96/22976, published August 1, 1996 by Ballinari et al teach methods 
of indolinone synthesis, methods of testing the biological activity of indolinone 
compounds in cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulating kinase activity include, but 

20 are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 

quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al, EPO Publication No. 0 520 722 
Al; Jones et al, U.S. Patent No.4,447,608; Kabbe et al, U.S. Patent No. 4,757,072; Kaul 

25 and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 

4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al, 
Proc. of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 
3:293-304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et 
al , Br. J. Cancer 53:361-368 (1 986); Fernandes et al , Cancer Research 43 : 1 1 1 7- 1 1 23 

30 (1983); Ferris et al J. Org. Chem. 44(2):173-178; Fry et al, Science 265:1093-1095 

(1994); Jackman et al, Cancer Research 51:5579-5586 (1981); Jones et al J. Med. Chem. 
29(6):1 1 14-1 1 1 8; Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, J. 
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Ore. Chem. 54:3511-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); 
Maxwell et al, Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al, Cancer 
Research 45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 1 7(1 9): 1 489-1 596 
(1980); Reece et al, Cancer Research 47(ll):2996-2999 (1977); Sculier et al, Cancer 
5 Immunol, and Immunother. 23:A65 (1986); Sikora et al, Cancer Letters 23:289-295 
(1984); Sikora et al, Analytical Biochem. 172:344-355 (1988); all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

10 Quinolines are described in Dolle et al, J. Med. Chem. 37:2627-2629 (1994); 

MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 
(1993); and Burke et al BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et al, Clin. Exp. Immunol. 91:141-156 (1993); 

15 Anafi et al. Blood 82:12:3524-3529 (1993); Baker et al, J. Cell Sci. 102:543-555 (1992); 

Bilder et al, Amer. Physiol. Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al, 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al, Experimental 
Cell Research 199:255-261 (1992); Dong et al. J. Leukocyte Biology 53:53-60 (1993): 
Dong et al, J. Immunol. 1 5 1 (5):27 1 7-2724 (1 993); Gazit et al , J. Med. Chem. 32:2344- 

20 2352 (1989); Gazit et al, " J. Med. Chem. 36:3556-3564 (1993); Kaur et al, Anti-Cancer 
Drugs 5 :2 1 3-222 ( 1 994); Kaur et al , King et al. , Biochem. J. 275 :4 1 3-4 1 8 ( 1 99 1 ); Kuo et 
al. Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 
Lyall et al, J. Biol. Chem. 264:14503-14509 (1989); Peterson et al, The Prostate 22:335- 
345 (1993); Pillemer et al, Int. J. Cancer 50:80-85 (1992); Posner et al. , Molecular 

25 Pharmacology 45:673-683 (1993); Rendu et al, Biol. Pharmacology 44(5):881-888 

(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, J. 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring el al.. ). Biol. 
Chem. 269(36):22470-22472 (1994); and Yoneda et al, Cancer Research 51 :4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 

30 drawings. 
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Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 
incorporated herein by reference in its entirety, including any drawings. 
VII. Biological Significance, Applications and Clinical Relevance of Novel Protein 
5 Kinases 

For each protein kinase in this application, we provide a classification of the 
protein class and family to which it belongs, a summary of non-cataltyic protein motifs, a 
profile of its expression in several hundred tissue and cell sources, and a chromosomal 
location. This information can be used to suggest potential function, regulation or 
10 therapeutic utility for each of the proteins. 

The kinase classification and protein domains often reflect pathways, cellular roles, 
or mechanisms of up- or down-stream regulation. Also disease-relevant genes often occur 
in families of related genes. For example if one member of a kinase family functions as an 
oncogene, a tumor suppressor, or has been found to be disrupted in an immune, 
15 neurologic, cardiovascular, or metabolic disorder, frequently other family members may 

play a related role. 

The expression analysis organizes kinases into groups that are transcriptionally 
upregulated in tumors and those that are more restricted to specific tumor types such as 
melanoma or prostate. This analysis also identifies genes that are regulated in a cell cycle 

20 dependent manner, and are therefore likely to be involved in maintaining cell cycle 

checkpoints, entry, progression, or exit from mitosis, oversee DNA repair, or are involved 
in cell proliferation and genome stability. Expression data also can identify genes 
expressed in endothelial sources or other tissues that suggest a role in angiogenesis, 
thereby implicating them as targets for control of diseases that have an angiogenic 

25 component, such as cancer, endometriosis, retinopathy and macular degeneration, and 

various ischemic or vascular pathologies. A proteins' role in cell survival can also be 
suggested based on restricted expression in cells subjected to external stress such as 
oxidative damage, hypoxia, drugs such as cisplatinum, or irradiation. Metastases- 
associated genes can be implicated when expression is restricted to invading regions of a 

30 tumor, or is only seen in local or distant metastases compared to the primary tumor, or 

when a gene is upregulated during cell culture models of invasion, migration, or motility. 
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Chromosomal location can identify candidate targets for a tumor amplicon or a 
tumor-suppressor locus. Summaries of prevelant tumor amplicons are available in the 
literature, and can identify tumor types to experimentally be confirmed to contain 
amplified copies of a kinase gene which localizes to an adjacent region. 
5 Based on these criteria several kinases immediately stand out as being of potential 

therapeutic relevance. The protein kinases can be divided into the following disease- 
relevant categories (nucleotide Seq ID #s in parentheses): 

Tumor associated: Mok (SEQ ID NO:NO:57), EPK2, AA3 16804 (SEQ ID 
NO: 11), AA435956 (SEQ ID NO:NO:48), AA278842 (SEQ ID NO:88), AA599286 (SEQ 
10 ID NO:89), AA826850 (SEQ ID NO:3), HRI (SEQ ID NO:73), MLK4 AA232253 (SEQ 

ID NO:82), AA883975 SGK 235 (SEQ ID NO:95), AA31 1714 (SEQ ID NO: 101), 
MPSK1 (SEQ ID NO: 110), R 19609 (Seq EDlll), AA383293 (SEQ ID NO:26). 

Prostate-specific: AA234451 (SEQ ID NO:47), TSK4 (SEQ ID NO:93), RIP4 
(SEQ ID NO:84), KIAA0965 (SEQ ID NO:8). 
15 Oncogenic or proliferation associated: KIAA0781 (SEQ ID NO:38), AA789239 

(SEQ ID NO:52), CORK (SEQ ID NO:54), CLK4 (SEQ ID NO:55), H85389 (SEQ ID 
NO:97). 

Neuronal restricted: CAMKKB (SEQ ID NO:66) 

Hematopoietic expressed: PTK9L (SEQ ID NO:22), DRAK2 (SEQ ID NO:29), 
20 AI025291 (SEQ ID NO:94) 

Angiogenic or endothelial expressed: DRAK1 (SEQ ID NO:31), MAK-V (SEQ 
ID NO:40), TRAD (SEQ ID NO:44), MOK (SEQ ID NO:57), AA08847 (SEQ ID NO:78), 
HGP_66444466 (SEQ ID NO:79), RSK4 (SEQ ID NO: 16). 

Cell cycle regulated: AA454060 (SEQ ID NO:45), KIAA0999 (Mitotic - SEQ ID 
25 NO:32), AA57964 1 (Mitotic - SEQ ID NO:60), AA305 1 76 (Mitotic - SEQ ID NO:6), 
AA018361 (SI phase -SEQ ID NO: 100). 
Vm. Transgenic Animals. 

A variety of methods are available for the production of transgenic animals 
associated with this invention. DNA can be injected into the pronucleus of a fertilized egg 
30 before fusion of the male and female pronuclei, or injected into the nucleus of an 

embryonic cell (e.g., the nucleus of a two-cell embryo) following the initiation of cell 
division (Brinstere/ ai, Proc. Nat. Acad. Sci. USA 82: 4438-4442, 1985). Embryos can 
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be infected with viruses, especially retroviruses, modified to carry inorganic-ion receptor 
nucleotide sequences of the invention. 

Pluripotent stem cells derived from the inner cell mass of the embryo and 
stabilized in culture can be manipulated in culture to incorporate nucleotide sequences of 
5 the invention. A transgenic animal can be produced from such cells through implantation 

into a blastocyst that is implanted into a foster mother and allowed to come to term. 
Animals suitable for transgenic experiments can be obtained from standard commercial 
sources such as Charles River (Wilmington, MA), Taconic (Germantown, NY), Harlan 
Sprague Dawley (Indianapolis, IN), etc. 

10 The procedures for manipulation of the rodent embryo and for microinjection of 

DNA into the pronucleus of the zygote are well known to those of ordinary skill in the art 
(Hogan et aL, supra). Microinjection procedures for fish, amphibian eggs and birds are 
detailed in Houdebine and Chourrout (Experientia 47: 897-905, 1991). Other procedures 
for introduction of DNA into tissues of animals are described in U.S. Patent No., 

15 4,945,050 (Sanford et ai 9 July 30, 1990). 

By way of example only, to prepare a transgenic mouse, female mice are induced 
to superovulate. Females are placed with males, and the mated females are sacrificed by 
CO2 asphyxiation or cervical dislocation and embryos are recovered from excised 
oviducts. Surrounding cumulus cells are removed. Pronuclear embryos are then washed 

20 and stored until the time of injection. Randomly cycling adult female mice are paired with 

vasectomized males. Recipient females are mated at the same time as donor females. 
Embryos then are transferred surgically. The procedure for generating transgenic rats is 
similar to that of mice (Hammer et ai, Cell 63:1099-1 112, 1990). 

Methods for the culturing of embryonic stem (ES) cells and the subsequent 

25 production of transgenic animals by the introduction of DNA into ES cells using methods 

such as electroporation, calcium phosphate/DNA precipitation and direct injection also are 
well known to those of ordinary skill in the art (Teratocarcinomas and Embryonic Stem 
Cells, A Practical Approach, E.J. Robertson, ed., IRL Press, 1987). 

In cases involving random gene integration, a clone containing the sequence(s) of 

30 the invention is co-transfected with a gene encoding resistance. Alternatively, the gene 
encoding neomycin resistance is physically linked to the sequence(s) of the invention. 
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Transfection and isolation of desired clones are carried out by any one of several methods 
well known to those of ordinary skill in the art (E J. Robertson, supra). 

DNA molecules introduced into ES cells can also be integrated into the 
chromosome through the process of homologous recombination (Capecchi, Science 244: 
5 1288-1292, 1989). Methods for positive selection of the recombination event (i.e., neo 

resistance) and dual positive-negative selection (i.e., neo resistance and gancyclovir 
resistance) and the subsequent identification of the desired clones by PCR have been 
described by Capecchi, supra and Joyner et al (Nature 338: 1 53-1 56, 1 989), the teachings 
of which are incorporated herein in their entirety including any drawings. The final phase 

10 of the procedure is to inject targeted ES cells into blastocysts and to transfer the 

blastocysts into pseudopregnant females. The resulting chimeric animals are bred and the 
offspring are analyzed by Southern blotting to identify individuals that carry the transgene. 
Procedures for the production of non-rodent mammals and other animals have been 
discussed by others (Houdebine and Chourrout, supra; Pursel et al, Science 244:1281- 

15 1288, 1989; and Simmse/a/., Bio/Technology 6:1 79-183, 1988). 

Thus, the invention provides transgenic, nonhuman mammals containing a 
transgene encoding a kinase of the invention or a gene effecting the expression of the 
kinase. Such transgenic nonhuman mammals are particularly useful as an in vivo test 
system for studying the effects of introduction of a kinase, or regulating the expression of 

20 a kinase (i.e., through the introduction of additional genes, antisense nucleic acids, or 

ribozymes). 

A "transgenic animal" is an animal having cells that contain DNA which has been 
artificially inserted into a cell, which DNA becomes part of the genome of the animal 
which develops from that cell. Preferred transgenic animals are primates, mice, rats, 
25 cows, pigs, horses, goats, sheep, dogs and cats. The transgenic DNA may encode human 

STE20-related kinases. Native expression in an animal may be reduced by providing an 
amount of anti-sense RNA or DNA effective to reduce expression of the receptor. 

IX. Gene Therapy 

30 Protein kinases of the invention, or their genetic sequences will also be useful in 

gene therapy (reviewed in Miller, Nature 357:455-460, 1992). Miller states that advances 
have resulted in practical approaches to human gene therapy that have demonstrated 
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positive initial results. The basic science of gene therapy is described in Mulligan 
(Science 260:926-931, 1993). 

In one preferred embodiment, an expression vector containing protein kinase 
coding sequence is inserted into cells, the cells are grown in vitro, and then are infused in 
5 large numbers into patients. In another preferred embodiment, a DNA segment containing 

a promoter of choice (for example a strong promoter) is transferred into cells containing 
an endogenous gene encoding kinases of the invention in such a manner that the promoter 
segment enhances expression of the endogenous kinase gene (for example, the promoter 
segment is transferred to the cell such that it becomes directly linked to the endogenous 

10 kinase gene). 

The gene therapy may involve the use of an adenovirus containing kinase cDNA 
targeted to a tumor, systemic kinase increase by implantation of engineered cells, injection 
with kinase-encoding virus, or injection of naked kinase DNA into appropriate tissues. 
Target cell populations may be modified by introducing altered forms of one or 

15 more components of the protein complexes in order to modulate the activity of such 

complexes. For example, by reducing or inhibiting a complex component activity within 
target cells, an abnormal signal transduction event(s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense mutants of a component, that 
retain the ability to interact with other components of the protein complexes but cannot 

20 function in signal transduction may be used to inhibit an abnormal, deleterious signal 

transduction event. 

Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adenovirus, adeno-associated virus, herpes viruses, several RNA viruses, or bovine 
papilloma virus, may be used for delivery of nucleotide sequences (e.g., cDNA) encoding 

25 recombinant kinase of the invention protein into the targeted cell population (e.g., tumor 

cells). Methods which are well known to those skilled in the art can be used to construct 
recombinant viral vectors containing coding sequences (Maniatis et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1989; Ausubel et 
al, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 

30 Interscience, N.Y., 1989). Alternatively, recombinant nucleic acid molecules encoding 

protein sequences can be used as naked DNA or in a reconstituted system e.g., liposomes 
or other lipid systems for delivery to target cells (e.g., Feigner et al, Nature 337:387-8, 
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1989). Several other methods for the direct transfer of plasmid DNA into cells exist for 
use in human gene therapy and involve targeting the DNA to receptors on cells by 
complexing the plasmid DNA to proteins (Miller, supra). 

In its simplest form, gene transfer can be performed by simply injecting minute 
5 amounts of DNA into the nucleus of a cell, through a process of microinjection (Capecchi, 

Cell 22:479-88, 1980). Once recombinant genes are introduced into a cell, they can be 
recognized by the cell's norma] mechanisms for transcription and translation, and a gene 
product will be expressed. Other methods have also been attempted for introducing DNA 
into larger numbers of cells. These methods include: transfection, wherein DNA is 

10 precipitated with CaP0 4 and taken into cells by pinocytosis (Chen et ai, Mol. Cell Biol. 

7:2745-52, 1987); electroporation, wherein cells are exposed to large voltage pulses to 
introduce holes into the membrane (Chu et ai, Nucleic Acids Res. 15:131 1-26, 1987); 
lipofection/liposome fusion, wherein DNA is packaged into lipophilic vesicles which fuse 
with a target cell (Feigner et at., Proc. Natl. Acad. Sci. USA. 84:7413-7417, 1987); and 

1 5 particle bombardment using DNA bound to small projectiles (Yang et ai, Proc. Natl. 

Acad. Sci. 87:9568-9572, 1990). Another method for introducing DNA into cells is to 
couple the DNA to chemically modified proteins. 

It has also been shown that adenovirus proteins are capable of destabilizing 
endosomes and enhancing the uptake of DNA into cells. The admixture of adenovirus to 

20 solutions containing DNA complexes, or the binding of DNA to polylysine covalently 

attached to adenovirus using protein crosslinking agents substantially improves the uptake 
and expression of the recombinant gene (Curiel et ai, Am. J. Respir. Cell. Mol. Biol., 
6:247-52, 1992). 

As used herein "gene transfer" means the process of introducing a foreign nucleic 
25 acid molecule into a cell. Gene transfer is commonly performed to enable the expression 

of a particular product encoded by the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically active RNA. Gene transfer can be 
performed in cultured cells or by direct administration into animals. Generally gene 
transfer involves the process of nucleic acid contact with a target cell by non-specific or 
30 receptor mediated interactions, uptake of nucleic acid into the cell through the membrane 
or by endocytosis, and release of nucleic acid into the cytoplasm from the plasma 
membrane or endosome. Expression may require, in addition, movement of the nucleic 
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acid into the nucleus of the cell and binding to appropriate nuclear factors for 
transcription. 

As used herein "gene therapy" is a form of gene transfer and is included within the 
definition of gene transfer as used herein and specifically refers to gene transfer to express 
5 a therapeutic product from a cell in vivo or in vitro. Gene transfer can be performed ex 

vivo on cells which are then transplanted into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid-protein complex into the patient. 

In another preferred embodiment, a vector having nucleic acid sequences encoding 
a protein kinase polypeptide of the invention is provided in which the nucleic acid 
10 sequence is expressed only in specific tissue. Methods of achieving tissue-specific gene 

expression are set forth in International Publication No. WO 93/09236, filed November 3, 
1992 and published May 13, 1993. 

In all of the preceding vectors set forth above, a further aspect of the invention is 
that the nucleic acid sequence contained in the vector may include additions, deletions or 
15 modifications to some or all of the sequence of the nucleic acid, as defined above. 

In another preferred embodiment, a method of gene replacement is set forth. 
"Gene replacement" as used herein means supplying a nucleic acid sequence which is 
capable of being expressed in vivo in an animal and thereby providing or augmenting the 
function of an endogenous gene that is missing or defective in the animal. 
20 X. Administration of Substances 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 
WO 96/22976, published August 1 1996, both of which are incorporated herein by 
25 reference in their entirety, including any drawings, figures, or tables. Those skilled in the 

art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used, and the size and physiological condition of 
30 the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
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takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
tumors, and major organs can be also be determined to facilitate the selection of drugs 
5 most appropriate to inhibit a disorder. Such measurements can be carried out. For 

example, HPLC analysis can be performed on the plasma of animals treated with the drug 
and the location of radiolabeled compounds can be determined using detection methods 
such as X-ray, CAT scan, and MRI. Compounds that show potent inhibitory activity in 
the screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

10 altering the chemical structure and retesting. In this regard, compounds displaying good 

pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 1) 
the compound is administered to mice (an untreated control mouse should also be used); 

15 2) blood samples are periodically obtained via the tail vein from one mouse in each 

treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition, and the percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

20 At the termination of each toxicity study, further studies can be carried out by 

sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc., 202:229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 

25 necropsy for immediate evidence of metastasis, unusual illness, or toxicity. Gross 

abnormalities in tissue are noted, and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 
effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not limiting and are merely representative of various 
aspects and features of the present invention. The examples below demonstrate the 
isolation and characterization of the protein kinases of the invention. 

EXAMPLE 1: Isolation of cDNA clones Encoding Novel Mammalian Protein Kinases 
Materials and Methods Identification from cDNA databases and isolation of clones 
encoding novel protein kinases 

Novel kinases were identified from the public EST databases using a Hidden 
Markov model, abbreviated HMM (Kxogh, A., Brown, M., Mian, I. S., Sjolander, K., and 
Haussler, D. 1994. Hidden Markov models in computational biology: Applications to 
protein modeling. J. Mol Biol, 235:1501-1531). The model was built with 70 
mammalian and yeast kinase catalytic domain sequences. These sequences were chosen 
from a comprehensive collection of kinases such that no two sequences had more than 
50% sequence identity. ESTs were translated in six open reading frames and were 
searched against the model. ESTs that had a score of at least 10 against the HMM were 
then masked for repetitive sequences and vectors and were clustered using MSA. The 
resulting contigs were searched against known kinases to identify EST clones that encode 
novel kinases. 

Approximately 40% of the ESTs encoding potentially novel kinases did not 
correspond to the correct EST upon sequence analysis. Most of these discrepancies were 
resolved by ordering additional clones, however, 14 remained unavailable. These 14 ESTs 
were amplified from a variety of single-stranded cDNA sources with primers derived from 
the corresponding EST entry as shown on Table 5. The PCR product was subcloned into a 
bluescript vector, digested to confirm the presence of a correct size insert and sequenced. 
Full sequencing of EST and PCR was carried out using a cycle sequencing Big-dye kit 
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with AmpliTaq DNA Polymerase, FS (ABI, Foster City, CA). Sequencing reaction 
products were run on an ABI Prism 377 DNA Sequencer. 

Table 5: Primers used to clone PCR products corresponding to novel kinases 





ID# 


ID* 


Parent 


5' primer 


3' primer 


sp 


na 


aa 


Sequence 


Sequence* 


Sequence* 


H 


33 


153 


2R22-5-1 1 


GAGATCGRNTTYAARGA 
RTTYGA 


TGTCACNCCNAGNSWCCAN 
AYRTT 


M 


8] 


200 


5R57_10_2_ 
m TESK2_m 


GCTGCTGGACAGTGACT 
TGTATTT 


GAAAGCAAAGCCTTCACAC 
CTT 


H 


67 


187 


5R69J7_2_h 


CTCTCACCTCAGGAACT 
GG 


GCTTGCGGATCTTCTCA 


H 


46 


166 


SGK309_h 


GACATCCTGCCGGCCAA 
CTACG 


CGGCCCTGGAGCTGCATCA 
CTA 


M 


67 


228 


5R72_16_2_h 


TGCGCGACACCATTGAC 
CAG 


CTCAGGGCTTACATACAGA 
G 


H 


45 


165 


5R72_8_2_h 


AAAGGAGAACTACATTT 
TGAAAAT 


CTTCATCATCTCTAATACAT 
TGGTTGG 


H 


41 


161 


Z36720 


CAAATTAAGATCATTGA 
CTTTGGG 


GGAAACAAAGTCCTTGGCC 
TC 


H 


115 


234 


AL031652 - 
Pak6 


GTGG A C ATCTGGTCCCT 
CG 


GTAGGTCCTTCACTCTTGG 
AG 



• degenerate oligonucleotide residue designation: 
N= A,C,G ot T 



R= A or G 
Y= C or T 
S = C or G 
W= A or T 

Full-length sequence extension of protein kinases using cDNA and genomic databases 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was carried out by iterative blastn searching of the cDNA databases listed in Table 
6. All blastn searches were conducted using a blosum62 matrix, a penalty for a nucleotide 
mismatch of -3 and reward for a nucleotide match of 1 . The gapped blast algortihm is 
described in: (Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and 
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PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 
25:3389-3402). 

Table 6. Databases used for cDNA-based sequence extensions 



Database 




LifeGold templates 


Feb 2000 


LifeGold compseqs 


Feb 2000 


LifeGold compseqs 


Mar 2000 


LifeGold compseqs 


Apr 2000 


LifeGold fl 


Feb 2000 


LifeGold flft 


Apr 2000 


NCBI human Ests 


May 2000 


NCBI murine Ests 


May 2000 


NCBI nonredundant 


May 2000 



5 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was also carried out by iterative searches of genomic databases. Three methods 
were used. The first method made use of the Smith-Waterman algorithm to carry out 
protein-protein searches of the closest homologue or orthologue to the partial kinase. The 

10 target databases consisted of Genescan and open-reading frame (ORF) predictions of all 

human genomic sequence derived from the human genome project (HGP) as well as from 
Celera. The complete set of genomic databases searched is shown in Table 7 below. 
Genomic sequences encoding potential extensions were further assessed by blastp analysis 
against the NCBI nonredundant to confirm the novelty of the hit. The extending genomic 

15 sequences were incorporated into the cDNA sequence after removal of potential introns 

using the Seqman program from DNAStar. The default parameters used for Smith- 
Waterman searches were as shown next. Matrix: blosum 62; gap-opening penalty: 12; gap 
extension penalty: 2. Genescan predictions were made using the Genescan program as 
detailed in (Chris Burge and Sam Karlin "Prediction of Complete Gene Structures in 

20 Human Genomic DNA", JMB (1997) 268(1 ):78-94). ORF predictions from genomic 
DNA were made using a standard 6-frame translation. 
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The second method for genomic sequence-based extensions made use of tBlastn 
searches of the homologue or orthologue to the partial kinase against the cDNA databases 
listed in Table 7. The recognition of significant hits in these databases made possible to 
identify bridging partial cDNA clones. The iterative application of the two methods made 
5 possible the assemblage of the virtual full-length sequence for a large number of the 

kinases presented in this application. All tblastn searches were conducted using a 
blosum62 matrix, a penalty for a nucleotide mismatch of -3 and reward for a nucleotide 
match of 1 . 

The last method for defining cDNA extensions from genomic sequence used 
1 0 iterative searches of genomic databases through the Genescan program to predict exon 

splicing and the Genewise program (http://www.sanger.ac.uk/SoftwareAVise2/ ) to predict 
potential ORFs based on homology to the closest orthologue/homologue. 
Table 7. Databases used for genomic-based sequence extensions 



Database 


Number of entries 


Database Date 


Celerav. 1-5 


5,306,158 


Jan 19/00 


Celera v. 6-10 


4,209,980 


Mar 24/00 


Celerav. 11-14 


7,222,425 


Apr 24/00 


Celera v. 1 5 


243,044 


May 14/00 


HGP all Genescan 


25,885 


Apr 04/00 


HGP; Phase 0 


4,944 


May 04/00 


HGP; Phase 1 


28,478 


May 05/00 


HGP; Phase 2 


1,508 


May 04/00 


HGP; Phase 3 


9,971 


May 05/00 



Virtual Extensions 

Human AA826850 (SEQ ID NO: 3, SEQ ID NO: 124) 
Blastn analysis of the partial AA826850 sequence revealed an extension to 
encompass the complete ORF in the Lncyte EST 238299.1 . A frame-shift correction at 
20 position 595 of this EST (marked by X in NA sequence) generated an uninterrupted ORF. 

Human AA960957 (SEQ ID NO: 4, SEQ ID NO: 125) 
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Since the initial filing of this application, the partial AA960957 sequence appeared 
in the public database as the full-length gene for a protein kinase encoded by a gene that 
maps adjacent to the eve (AJ250839) (ellis-van creveld syndrome and weyers acrodental 
dysostosis) gene from 4pl6.1. 

Human 5R79-46-l_h (SEQ ID NO: 5, SEQ ID NO:126) 
Blastn analysis of the partial 5R79-46-1 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 463894.6. Since the initial filing of this 
application, the full-length virtual 5R79-46-1 appeared in the public database as the full- 
length gene for the TANK-binding kinase (TBK1) (PomerantzJ.L. and Baltimore,D. 
(1999) EMBO J. 18 (23), 6694-6704). TBK1 participates in NF-kB activation through the 
formation of a signaling complex with TRAF2 and TANK. 

Human AA305176 (SEQ ID NO: 6, SEQ ID NO: 127) 
Blastn analysis of the partial AA3051 76 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 220937.1. 

Human AA256100 (SEQ ID NO: 8, SEQ ID NO: 129) 
Blastn analysis of the partial AA256100 sequence revealed an extension to 
encompass the complete ORF through the assembly of three partial clones: Incyte EST 
480815.6, K1AA0965 (BAA76809) and AA256100. 

Human AA2 10825 (SEQ ID NO: 9, SEQ ID NO: 130) 
Blastn analysis of the partial AA2 10825 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
EST 014721.7, and the NCB1 EST's AW01 158 and AA210825. An insertion of two "N's" 
at positions 1915 and 1916 generated an uninterrupted ORF. Blastx analysis indicated the 
possibility of a start Met in the range of 400-450 nucleotides (i.e. compared to the closest 
homolog, human PKCmu (CAA53384.1). However, no Met was found in this region; 
rather ORF ends in an in-frame stop preceeded by the sequence 

"RGLLAPGDPPCPPPNPAPATPPSSRLPTELFSNFCDS". It is possible that part of the 
sequence covered by nucleotide positions 1-400 derived from AW01 158 comes from an 
intron, explaining the absence of a start Met. 

Human AA127299 (SEQ ID NO: 10, SEQ ID NO:131) 

No entries in the database extended this sequence. The 1684 bp insert of this EST 
contains a 1369 bp intron at the 3' end. Blastx and SW analysis of the 315 bp coding 
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region revealed homology to the extracatalytic C2 domain of PKC. This EST, may or may 
not encode a kinase. 

Human AA3 16804 (SEQ ID NO:l 1, SEQ ED NO: 132) 

Since the initial filing of this application, the partial AA3 16804 sequence appeared 
5 in the public database as the full-length gene for the PKC family protein kinase EPK2 or 

PKCnu (AB015982). 

Human H19102 (SEQ CD NO:14, SEQ ID NO:135) 

Genewise and Genescan analyses of the partial HI 91 02 sequence revealed an 
extension from the HGP phase 3 contig 3810672 to encompass the complete catalytic 

10 domain of this EST. Blastn analysis against the non-redundant database revealed that this 

gene is found in the cosmid AC005726 from chromosome 17. H19102 may encode a dual 
catalytic kinase given the homology to S6 kinase. Analysis of genomic sequence upstream 
of the 5' end of HI 91 02 revealed a non-kinase gene oriented in the same polarity as 
H19102 suggestive of the start Met for H19102 being close to the 5' end of the H19102 

15 sequence. From this analysis it is deduced that the second catalytic domain of HI 91 02, if 

present, is most likely located within the 47334-1 85,21 5 bp region of the genomic 
sequence of AC005726. 

Human AA476563 (SEQ ID NO: 15, SEQ ED NO:136) 

Since the initial filing of this application, the partial AA476563 sequence appeared 
20 in the public database as the full-length gene for the protein kinase RPS6KC1 

(NMJH2424) (Zhang, H. et al Genomics (1999) 61, 314-318), which is an S6 kinase 
mapping to 12ql2-ql3.1. 

Human AA626690 (SEQ ID NO: 16, SEQ ID NO: 137) 

Since the initial filing of this application, the partial AA626690 sequence appeared 
25 in the public database as the full-length gene for the protein kinase RPS6KA6 (AF1 84965) 

(Yntema,H.G et al (1999) Genomics 62, 332-343), an S6 kinase commonly deleted in 
patients with complex X-linked (Xq21 .1 ) mental retardation. 
Human AI2 15680 (SEQ ID NO: 17, SEQ ID NO: 138) 

Since the initial filing of this application, the partial AI2 15680 sequence appeared 
30 in the public database as the full-length gene encoding a hypothetical protein (AAD301 82) 
from the locus AC006530.4 from chromosome 14. 

Human AA887783 (SEQ ID NO:21, SEQ ID NO:142) 
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Blastn analysis of the partial AA887783 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
415390R6 and the NCBI EST's AA887783 and N94726. Since the initial filing of this 
application, the nearly full-length virtual AA887783 sequence appeared in the public 
database as the full-length gene encoding SGK3 (AF1 69035), a serum- and 
glucocorticoid-induced protein kinase (Kobayashi,T. et a] (1999) Biochemical J. 344, 1 89- 
197. 

Human R47805 (SEQ ID NO:22, SEQ ID NO: 143) 

A cDNA clone encoding the full-length ORF of R47805 was isolated using 
R47805 as a screening probe. A full-length form for R47805 has also appeared in the 
public database as 

PTK9L (NM J)07284), an A6-related protein kinase. 

Human H60215 (SEQ ID NO:23, SEQ ID NO:144) 

Blastn analysis of the partial H60215 sequence revealed an extension to encompass 
the complete ORF in the public EST AI275726. This was confirmed through the full insert 
sequencing of this EST (2,310 bp) which corresponds to the sequence under SEQ ID 
NO: 144. 

A different stop codon was predicted for AI275726 compared to H60215 due to a 
single nucleotide insertion at position 1586 in A1275726. Evidence for the extra nucleotide 
comes from EST All 91 922. 

SGK324 h orthologue of W30246_m (SEQ ID NO:24 , SEQ ID NO: 145) 

Blastn, blastx and Smith- Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding to the human orthologue of 
murine W30246. Exons predicted from the following sequences were used for contig 
construction: Celera 17000189645083, 17000057549105 and 1 1000501939981; 
Incytel42404.1,HGP_7249119, Incyte 71 96489H1, Celera 1 1000501939981, 
17000028165594; Incyte 72491 19_3, Celera 17000035772368, 11000502081575 and 
17000140274329. The latter Celera sequence provides the N-terminus. 

Human AA383293 (SEQ ID NO:26, SEQ ID NO: 147) 

Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding for AA383293. Exons predicted 
from the following sequences were used for contig construction: (numbers in parenthesis 
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refer to the aa sequence of the closest homolog (RU2S, NP_057440) used for the Smith- 
Waterman query): N-term from Incyte 6010175_2 (14-97), Incyte 6981981 (134-184) 
7596749 (186-232) Celera 17000020789545 (243-301) CAB75619.1 (310-341)-(56-145 
DCX homology) 6010175_2 , Celera 17000030058129 (241-262 DCX homology). 
5 Human AA02 1 445 (SEQ ED NO:32, SEQ ID NO: 1 52) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. Contig reconstruction was as follows: nucleotides 1-802 
from KIAA0999 (AB023216); nucleotides 803-4321 from full-insert sequence of 
AA021445. A pairwise alignment between the AA021445 and K1AA0999 revealed three 
10 inserts in the extracatalytic C-terminus of 48, 48 and 161 aminoacids. In addition, both 

AA021445 and KIAA0999 have 15 copies of a CAG repeat. Trinucleotide repeats are 
often found in genes that linked to neurodegenerative diseases. 
Human 2R22-55-1 (SEQ ID NO:33, SEQ ID NO: 153) 
Blastn analysis revealed an extension in the Incyte EST clone 321074.1 to 
1 5 encompass the complete ORF corresponding to 2R22-55-1 . 

Human orthologue of AA544838_m (SEQ ID NO:36, SEQ ID NO:156) 
tBlastn analysis identified the partial human K1AA0135 (U79240) clone as the 
human orthologue of murine AA544838. Blastn revealed an extension KlAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from 
20 lncyte406786.5, KFZp430051 and KIAA0135 (U79240). 

Human orthologue of AI785735_m (SEQ ID NO:38, SEQ ED NO: 158) 
tBlastn analysis identified the partial human K1AA0781 (AB018324) clone as the 
human orthologue of murine A1785735. Blastn revealed an extension KIAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from Incyte 
25 986123.37 KIAA0781 (AB018324). 

Human AA207220 (SEQ ID NO: 39, SEQ ID NO: 159) 
Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. The full ORF was reconstructed from Incyte 402740.1 and 
AA207220. Frame corrections: deletion of 441 and 595 over Inc402740.1 seq based on 
30 blastx to keep frame open; two n insertions 940, 941 over AA207220 to keep frame open. 

Human AA426580 (SEQ ID NO:40, SEQ ID NO: 160) 
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Since the initial filing of this application, the partial AA426580 sequence appeared 
in the public database as the full-length gene encoding MAK-V (AJ271722) from 
chromosome 21q22.1. 

Human 5R79-54-1 (SEQ ID NO: 41, SEQ ID NO: 161) 
5 Genewise and Genescan analyses of the partial 5R79-54-1 sequence revealed an 

extension from genomic sequence to encode the full ORF for 5R79-54-1. 

Human orthologue of AA542015_m (SEQ ID NO: 42, SEQ ID NO: 162) 

tBlastn analysis identified KIAA1297 (AB037718). Blastn extended the 
K1AA1297 sequence to provide the C-terminus through the Incyte 224074.1 EST. The 
10 partial ORF consists of a dual catalytic domain flanked by 6 Ig domains and 2 fibronectin 

repeats. Based on homology to the bt drosophila protein (AAF59316.1), the human form 
of AA542015 is expected to be missing 16 Ig domains. 

Human Rl 9772 (SEQ ID NO:44, SEQ ID NO: 1 64) 

The full-length ORF for R19772 was isolated by screening a cDNA library using a 
15 probe derived from R 19772. Since the initial filing of this application, the R 19772 

sequence appeared in the public database as the full-length gene encoding Trio (Duet) 
(AB01 1422). CDNA library screening revealed multiple isoforms for this gene which are 
summarized in the Table below. 

20 TableS. Isoforms for R 19772 



Kestrl Name 


Kestrl 
AA Acc # 


Isoform 
type 


Source 


Description* 


Trad (Duet) 


R19772 


B 


Skeletal 
muscle 


Deletion of K at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 






C 


Skeletal 
muscle 


Deletion of K at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 
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Deletion of32aa (160-191) 






D 


Lung tumor 


Deletion of Q at 616 










Deletion of 32 aa (160-191) 






E 


Lung tumor 


Deletion of Q at 616 










Deletion of32aa (160-191) 



* reference amino acid position are with respect to sequence of Trad (ABO 1 1422) 



Human AA435956 (SEQ ID NO:48, SEQ ID NO: 168) 
5 Blastn analysis revealed an extension to encompass the nearly complete catalytic 

region of AA435956. 5' end sequence extension was provided by genomic locus 
AC007242.3_h (range 44880-43801). Based on blastx analysis, the extended sequence 
encodes is full-length at the C-terminus. 

Human AA397553 (SEQ ID NO: 51, SEQ ID NO: 171) 
10 Since the initial filing of this application, the partial AA397553 sequence appeared 

in the public database as the full-length gene encoding CRK7 (AF227198), a novel CDC2- 
related protein kinase that colocalizes with interchromatin granule clusters. 

Human AA789239 (SEQ ID NO: 52, SEQ ID NO:172) 

Since the initial filing of this application, the partial AA789239 sequence appeared 
1 5 in the public database as the full-length gene encoding NKIAMRE (AF1 30372), a novel 

kinase deleted in human leukemia. 

Human AA631990 (SEQ ID NO:55, SEQ ID NO:175) 
Blastn analysis revealed an extension to encompass the full-length ORE for 
AA631990. The full ORF was reconstructed from 253847.5 and AA631990 and 
20 AA207220. Frame corrections: delete 1 C at 1380, delete 2N's at 2033/2034. 
Human AA557536 (SEQ IDNO:56, SEQ ID NO: 176) 
Blastn analysis revealed an extension to encompass full-length ORF for 
AA557536. The full ORF was reconstructed from AA557536, celera 1 1000504061899 
and the Incyte 097089.1 EST. An 85bp intron was removed from AA557536. 
25 Human N34132 (SEQ ID NO: 63, SEQ ID NO: 183) 

Full sequencing of EST N34132 (1.3 kb) confirmed that this cDNA encodes a 
novel NEK-subfamily kinase. Blast analysis against the EST database showed that four 
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EST sequences (AA283140, AA283140, AA28291 1 and N5301 1) extended the sequence 
of N34132 at the 3' end to form a 231 kb contig. Blast analysis of the new contig against 
the nonredunat public database showed that the N34132 extended contig overlapped 
(100% identity) over 228 bp at its 3' end with human KJAA0344 (AB002342), a 5, 787 bp 
5 cDNA encoding a 1 246 aa polypeptide. The 5' 790 bp of the K1AA0344 cDNA (encoding 

the 58 N-terminal protein sequence) were found to be divergent with respect to the 
extended 2.32 kb N34132 contig. Evidence that the extended N34132 contig (2.3 Ikb) and 
KIAA0344 (AB002342) belong to the same gene is the following. First, blast analysis of 
the nucleotide sequences for N34132 and K1AA0344 against the NRN database confirmed 

10 that these cDNA's are transcribed from the same genomic locus defined by two 

overlapping BACs (AC004765 and AC004803) from chromosome 12pl3.3. Second, full 
sequence determination of a PCR fragment amplified from single-stranded cDNA 
confirmed the junction between the extended N34132 contig and KIAA0344_h 
(AB002342). The 462 PCR product was amplified with primers 

1 5 CTCCTC AACAG AC AGTGCAG (5 ' primer) and GACATTCTACTACTCGGTCTC (3 ' 

primer) designed from the N34132 extended contig and K1AA0344 sequences, 
respectively. The region of N34132 containing the start Met was isolated by PCR from a 
testis cDNA library (Clontech). 

Human 5R69-1 7-2 (SEQ ID NO:67, SEQ ID NO: 1 87) 

20 The full-length ORE for 5R69- 17-2 was isolated by screening a cDNA library 

using a probe derived from 5R69-17-2. 

Human H8581 1 (SEQ ID NO:68, SEQ ID NO: 188) 

Tblastn, Smith-Waterman and blastn analyses using cDNA databases revealed an 
extension to encompass full-length ORE for H8581 1 . The full ORF was reconstructed 
25 from IncyteESTs 202971.8, 034583.3 and 034583.1 and public ESTs H8581 1 and 
AI570599. 

Human R43524 (SEQ ID NO:73, SEQ ID NO: 192) 

Blastn analysis revealed an extension to encompass the complete catalytic region 
and the C-terminus of R43524. Since the initial filing of this application, the partial 
30 R43524 sequence appeared in the public database as the full-length gene encoding the 
heme-regulated initiation factor 2-alpha kinase (HRJ) (AF1 81071). 

Human AA088547 (SEQ ID NO:78, SEQ ID NO: 197) 
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Genewise and Genescan analyses of genomic databases revealed an extension to 
encompass the complete ORF for AA088547. 

Human orthologue of AA139478_m (SEQ ID NO:80, SEQ ID NO: 199) 
Tblastn identified the lncyte 21 1475.1 as the potential full-length human 
5 orthologue of murine AA 139478 

Human AA232253 (SEQ ID NO:82 5 SEQ ID NO:201) 

The full-length ORF for AA232253 was isolated by screening a cDNA library 
using a probe derived from AA232253. Since the initial filing of this application, the 
AA232253 sequence appeared in the public database as the full-length gene encoding SLK 
10 (AB01 1422). SLK is a stress-regulated mixed lineage kinase-like protein that activation 

of Rac and induction of apoptosis. cDNA library screening revealed multiple isoforms for 
this gene which are summarized in the Table below. 

Table 9. Isoforms for AA232253 

15 



Kestrl 
Name 


Kestrl AA 

Acc# 


Isoform 
type 


Description* 


MLK4 


AA232253 


MLK4 


Substitution of C for W at 346 






MLK4B 


Different Cterm (332-800); seq in MLK4B is as 
shown in * 



* C-terminus specific to MLK4B 

LPLAARMSEESYFESKTEESNSAEMSCQITATSNGEGHGMNPSLQAMMLMGFGD1 
FSMNKAGAVMHSGMQINMQAKQNSS 
20 KTTSKRRGKKVNMALGFSDFDLSEGDDDDDDDGEEEDNDMDNSE 

Human H97685 (SEQ ID NO:84, SEQ ID NO:203) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
H97685. The full ORF was reconstructed from lncyte 474824.1 and the public ESTs 
25 H97685 andM62021. 

Human AI052250 (SEQ ID NO: 87, SEQ ID NO:206) 
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Blastn analysis revealed an extension to encompass the full-length ORF for 
A1052250. The full ORF was reconstructed from Incyte 396868.1, the public partial 
cDNA FLJ 10074 (minus intron) and the public ESTs and the public ESTs AI052250 and 
H97685, A1499220 and M62021. 
5 Human AA278842 (SEQ ID NO:88, SEQ ID NO:206) 

A nearly full-length cDNA (FL4F12) for AA278842 was isolated by screening a 
cDNA library using a probe derived from AA278842. A full-length virtual ORF was 
generated using FL4F12 and AA278842. 

Human AA599286 (SEQ ED NO:89, SEQ ID NO:208) 
1 0 Since the initial filing of this application, the partial AA599286 sequence appeared 

in the public database as a full-length ORF (AK000342). 

Human AA425725 (SEQ ID NO:90, SEQ ID NO:209) 

Since the initial filing of this application, the partial AA425725 sequence appeared 
in the public database as MSSK1, a serine kinase gene located from human chromosome 
15 Xq28. 

Human SGK022 orthologue of AA060026_m (SEQ ID NO:91, SEQ ID NO:210) 

Tblastn, Smith- Waterman and blastn analyses of cDNA and genomic databases 
databases revealed a potential human orthologue for murine AA060026. The full-length 
ORF for SGK022 was reconstructed from genomic locus AC022307. 
20 Human AA399669 (SEQ ID NO:93, SEQ ID NO:212) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA399669. The full ORF was reconstructed as follows: sequence 1-1007 from 
AL136295.2; sequencel 008-23 19 from AA399669 and Incyte 428177.1. 

Human AA883975 (SEQ ID NO:95, SEQ ID NO:214) 
25 Genescan and Genewise analyses of the genomic databases revealed an extension 

for AA883975 to encompass the full-length ORF 

Human AA905446 (SEQ ID NO:96, SEQ ID NO:215) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed an extension for AA905446 to encompass the full-length ORF. For the 
30 Smith-Waterman analysis murine STK22 ( NPJB3462) was used as the closest 

orthologue. Contig formation: range 162133-163687 from HGP_h 6921333_9; removed 
intron (146-893) predicted from blastx analysis. 
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Human H29974 (SEQ ID NO: 97 SEQ ID NO:216) 

Blastn analysis revealed an extension to encompass a complete catalytic ORF for 
AA3 99669. The nearly full-length ORF was reconstructed using Incyte 213829.1 and 
H29974. 

5 Human AA21531 1 (SEQ ID NO:99, SEQ ID NO:218) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA21531. The full ORF was reconstructed from Incyte 067584.1, 022456.1, AA21531 1 
and the reverse complement of CPG_043208. 

Human AA018361 (SEQ ID NO:100, SEQ ID NO:219) 
10 The full-length ORF for AA018361 was isolated by screening a cDNA library 

using a probe derived from AA01 8361 . This yielded clone Sug4-30. Clone Sug4-30, like 
multiple, independent cDNA clones contained a 181bp intron. The existence of intron-less 
RNA's was confirmed by a PCR reaction that generated a product that upon sequence 
analysis skipped the intron region. The full-length virtual ORF for AA01 8361 was 
1 5 generated through a contig between AL1 1 7482 (seq 1 -367) and the sequence for clone 

Sug4-30. 

Human orthologue of AA396601_m (SEQ ID NO:106, SEQ ID NO:225) 
tBlastn and Smith-Waterman analyses of genomic sequence revealed an extension 

to encompass the full catalytic region for the human orthologue of AA396601. The ORF 
20 was reconstructed from Incyte 018653.9 (7261449H1, 6891740J1) and genomic sequence 

CPG_040010. 

Human orthologue of AA671275_m (SEQ ID NO: 108, SEQ ID NO:227) 
Since the initial filing of this application, a potential human orthologue for murine 
AA671275 appeared in the public database as the full-length ORF for vaccinia related 
25 kinase 3 (BAA90769). 

Human H05721 (SEQ ID NO: 1 1 1, SEQ ID NO:230) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for H05721 . 

Human AI086865 (SEQ ID NO:l 12, SEQ ED NO:231) 
30 Genescan and Genewise analyses of genomic sequence revealed an extension to 

encompass the full-length ORF for AI086865. The fiill-length ORF was reconstructed 
from Celera 17000102901516, Incyte 243269.1 and public AL1 37753 1 . 
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Human AA836348 (SEQ ID NO: 1 1 3, SEQ ID NO:232) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for AA836348. 

Human R86668 (SEQ ID NO:14, SEQ ID NO:233) 

The full-length ORF for R86668 was isolated by screening a cDNA library using a 
probe derived from R86668. Since the initial filing of this application, the R8668 sequence 
appeared in the public database as the full-length gene mitogen-activated protein kinase 
kinase kinase 6 (MAP3K6) (NMJXW67). 

Human 2R4 1-9-4 (SEQ ID NO: 16, SEQ ID NO:235) 

The full-length virtual ORF for 2R4 1-9-4 was generated using genomic sequence 
to provide the Nterminus for the partial ORF predicted from clone 2R41-9-4 

Table 10. Sequences deleted from the provisional patent due to duplication with other 
genes in the patent 



Prov. SEQ ID NO: (na) 


Prov. SEQ ID NO: (aa) 


160 


196 


213 


214 


215 


216 


122 


126 


119 


123 


148 


184 


4 


20 


7 


23 


205 


206 


14 


30 


15 


31 


35 


56 


42 


63 


51 


72 


44 


65 


77 


91 



WO 00/73469 



PCI7US00/14842 



120 



78 


92 


79 


93 


80 


94 


157 


193 



Results 

Table 1 documents the results from the analysis of the nucleic acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 
5 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "ED#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group' 1 refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

10 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et ah (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "ORF Start", 
"ORF End", "ORF Length" refer to the open reading frame range and length as calculated 
by standard nucleic acid translation programs such as MapDraw (DNAStar). "DNA 
Repeats" refers to regions of low complexity sequence or repetitive elements such as Alu, 

15 LINE, SINE, and LTR sequences. The chromosomal location (CHR localization) for 37 

of the 1 10 novel protein kinases is shown on Table 1 (NA, not available). The methods 
for determining chromosomal position are outlined below, in Example 2. 

Table 2 documents the results from the analysis of the amino acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 

20 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "ID#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

25 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "nraa Score", 
"ID match aa", "Identity", "Similar", "nraa Match Acc#", Description" refer to the data 
obtained using a Smith-Waterman search of the amino acid sequence against the non- 
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redundant protein database (Matrix: PamlOO; gap open/extension penalties 14/1). "Kinase 
Domain Start", "Kinase Domain End", "Profile Start" and "Profile End" refer to data 
obtained using a Hidden-Markov Model to define catalytic range boundaries. The profile 
has a length of 261 amino acids, corresponding to the complete protein kinase catalytic 
5 domain. Proteins in which the profile recognizes a full length catalytic domain have a 

"Profile Start" of 1 and a "Profile End" of 261 . The boundaries of the catalytic domain 
within the overall protein are noted in the "Kinase Domain Start" and "Kinase Domain 
End" columns. 

10 The following abbreviations were used for kinases: 



ASK 


Apoptosis signal-regulating kinase 


CaMK 


Ca2+/calmodulin-dependent protein kinase 


CCRK 


Cell cycle-related kinase 


CDK 


Cyclin-dependent kinase 


CK 


Casein kinase 


DAPK 


Death-associated protein kinase 


DM 


myotonic dystrophy kinase 


Dyrk 


dual-specificity-tyrosine phosphorylating-regulated kinase 


GAK 


Cyclin G-associated kinase 


GRK 


G-protein coupled receptor 


GuC 


Guanylate cyclase 


HIPK 


Homeodomain-interacting protein 


IRAK 


Interleukin-1 receptor- associated kin 


MAPK 


Mitogen activated protein kinase 


MAST 


Micotubule-associated STK 


MLCK 


Myosin-light chain kinase 


MLK 


Mixed lineage kinase 


NMA 


NimA-related protein kinase 


PKA 


cAMP-dependent protein kinase 


RSK 


Ribosomal protein S6 kinase 


RTK 


Receptor tyrosine kinase 
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SGK Serum and glucocorticoid-regulated kinase 
STK serine threonine kinase 
ULK UNC-5 1 -like kinase 



The following abbreviations were used for species 



H 


Human 


M 


Murine 


R 


Rat 


FV 


Fowlpox virus 


MT 


M. thermoautotrophicum 


CE 


Caenorhabditis elegans 


DM 


Drosophila melanogaster 


OS 


Oryza sativa 


SP 


Schizosaccharomyces pombe 


TP 


Tetrahymena pyriformis 


PI 


Petunia inflata 


NC 


Neurospora crassa 


MSV 


Medicago sativa 


MSV 


Moloney murine sarcoma virus 


SA 


Squalus acanthias 


CS 


Cucumis sativus 


GM 


Glycine max 


LL 


Lilium longiflorum 


TV 


Trichomonas vaginalis 


MP 


Mycoplasma pneumoniae 


DD 


Dictyostelium discoideum 


SC 


Saccharomyces cerevisiae 


MT 


M ethanobacteri urn thermoautotrophi cum 
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Domain and Motif Identification 

A Hidden Markov model (HMM) (Krogh, A., Brown, M., Mian, I. S., Sjolander, 

K., and Haussler, D. (1994). Hidden Markov models in computational biology: 

Applications to protein modeling. J. Mol. Biol., 235:1501-1531) was used to identify, 
5 both catalytic and extracatalytic domains. Table 4 shows extra-catalytic domains that 

were identified using the HMM program. Other domains such as coiled-coil and pest 

motifs were identified as described next. 

Potential coiled-coil domains were identified using the COILS program 

(www.ch.embnet.org/software/COILS_form.html). The matrix used was MTIDK with 
10 windows of 14, 21, 28 amino acids. Only regions scoring 0.5 or higher were considered to 

have potential coiled-coil domain region. 

Protein sequences containing potential pest motifs were identified using the 

program PESTfind (www.at.embnet.org/embnet/tools/bio/PESTfind/). PEST regions in 

proteins are by definition sequences that tend to be rich in proline, glutamic or aspartic 
15 acid, argininine and histidine; they have been associated with increased protein turnover 

rates (Rogers S. et al. (1986) Science 234, 364-368. The algorithm defines PEST 

sequences as hydrophilic stretches of amino acids greater than or equal to 12 residues in 

length. Such regions contain at least one P, one E or D and one S or T. They are flanked 

by lysine (K), arginine (R) or histidine (H) residues, but positively charged residues are 
20 disallowed within the PEST sequence. PESTfind produces a score ranging form about -50 

to +50. By definition, a score above zero denotes a possible PEST region; a value greater 

than +5 defines a high probability that there is a PEST domain. 

Identification of potential coiled-coil domains and PEST domains in N34132 

Potential coiled-coil domains were identified in N34132 (SEQ ID NO: 183) using 
25 the COILS program. Only regions scoring 0.5 or higher were considered to have potential 

coiled-coil domain region. The amino acid positions within N34231 scoring for potential 

coil-coil regions are shown below. 
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Table 1 1 coiled-coil domains predicted for N34132 



Coiled-coil Region 


Amino acid range 


Length (aa) 


1 


124-147 


24 


2 


437-451 


15 


3 


495-526 


32 


4 


1,723-1,749 


27 



Potential PEST domains were identified in N34132 using PESTfmd, a value 
greater than +5 defines a high probability that there is a PEST domain. The amino acid 
positions within N34132 scoring for potential PEST regions are shown below. 



Table 12 Potential Pest domains identified in N34132 



PEST Region 


Score 


Amino acid range 


Amino Acid Length 


1 


+ 4.91 


54-95 


42 


2 


+11.4 


537-570 


34 


3 


+31.08 


1293-1304 


12 


4 


+10.15 


1543-1565 


23 


5 


+ 6.17 


1698-1732 


35 



EXAMPLE 2, Chromosomal Localization of Novel Mammalian Protein Kinases 
Materials and Methods 

Several sources were used to find information about the chromosomal localization 
of each of the genes described in this patent. First, the accession number for the nucleic 
acid sequence was used to query the Unigene database. The site containing the Unigene 
search engine is: http://www.ncbi.nlm.nih.gov/UniGene/Hs.Home.html. Information on 
map position within the Unigene database is imported from several sources, including the 
Online Mendelian Inheritance in Man (OMIM, 

http://www.ncbi.nlm.nih.gov/Omim/searchomim.html), The Genome Database 
(http://gdb.infobiogen.fr/gdb/simpleSearch.html), and the Whitehead Institute human 
physical map (http://carbon.wi.mit.edu:8000/cgi-bin/contig/sts_info?database=release). 
For example, searching Unigene with W56561, an EST for a MAK-like kinase, the 
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following information is retrieved: Chr.14, D14S65-qTEL. The location of this gene on 
an "ideogram" of the cytogenetic map of chromosome 14 is also provided, showing that 
W56561 maps to the bottom of chromosome 14, between 14q31 and 14qTel. If Unigene 
has not mapped the EST, then the nucleic acid for the gene of interest is used as a query 
5 against databases, such as dbsts and htgs (described at 

http://www.ncbi.nlm.nih.gov/BLAST/blast_databases.html) containing sequences that 
have been mapped already. The nucleic acid sequence is searched using BLAST-2 at 
NCBI (http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast) and is used to query 
either dbsts or htgs. In addition to the Whitehead and GDB sites mentioned above, 

10 Stanford University maintains a useful site for chromosomal mapping from STS data 

(http://www-shgc.stanford.edii/RH/rhserverformnew.html). Matches in htgs are often 
resolved immediately because the genomic region hit is annotated in the htgs entry. If an 
exact match match is found (defined roughly as 99% identity over a region of about 100 
base pairs or longer, excluding any repetitive sequence), then the mapped position of the 

1 5 entry in the database is assigned to the original kinase query. Once a cytogenetic region 

has been identified by one of these approaches, disease association is established by 
searching OMIM (see above for URL) with the cytogenetic location. OMIM maintains a 
searchable catalog of cytogenetic map locations organized by disease. A thorough search 
of available literature for the cytogenetic region is alo made using Medline 

20 (http://www.ncbi.nlm.nih.gov/PubMed/medline.html). References for association of the 

mapped sites with chromosomal abnormalities found in human cancer can be found in: 
Knuutila, et al 9 Am J Pathol, 1998, 152:1107-1123. 



Results 

25 The chromosomal location for 37 of the 1 10 novel protein kinases is shown on 

Table 1. Three of the novel protein kinases were mapped to regions associated with 
cancer amplicons, as shown on this table. The regions were also cross-checked with the 
Mendelian Inheritance in Man database, which tracks genetic information for many human 
diseases, including cancer. References for association of the mapped sites with 

30 chromosomal abnormalities found in human cancer can be found in: Knuutila, et al., Am J 

Pathol, 1998, 152:1107-1 123. Association of these mapped regions with other diseases is 
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documented in the Online Mendelian Inheritance in Man (OMIM) 
(http://www.ncbi.nlm.nih.gov/htbin-post/Omirn). 

EXAMPLE 3: Generation of Specific Immunoreagents 

5 Materials and Methods 

Peptide sequences to extra-catalytic regions of novel kinases are chosen which are 
not homologous to other known kinases based on a Smith Waterman homology search 
against the non-redundant protein database and predicted to be antigenic based on the 
DNAStar Protean program. These peptides are conjugated to KLH using Glutaraldehyde. 

10 Rabbits are immunized with the KLH-peptide conjugates by four injections three 

weeks apart. The rabbits are bled ten and fourteen days following the third injection and 
bled out ten days after the fourth. The serum is checked against the peptide by EL1SA. 



Table 13. Peptides to be used as immunogens for raising antibodies 



Clone 
Name 


SEQ ID 
NO (aa) 


Peptide Sequence 


Amino Location 


AA8256850 


124 


KSRDNSRDSSQSEND 


339-353 






TEKLKRS QDLPREPLP 


372-386 






RGWRPYDIHS 


223-232 


5R79-46-1 


126 


FEGPRRNKEVMYK 


224-236 






KDDYNETVHKKTE 


451-463 






GTHPKDRNVEKLQ 


541-553 






EV SK YQE YTNELQET 


643-657 


AA256100 


129 


IDDTSNFDDFPESDI 


405-419 






TEPDYKSKDWVFL 


427-439 






EEKKLRRSQHARKET 


61-75 


AA210825 


130 


SNKDTLRKRHYWRLD 


507-521 






RHTTRKSSTTLRE 


488-500 






FQNNTTNRYYKEIPL 


528-542 






GKHRKTGRDVAVK 


668-680 






FPTKQESQLRNE 


687-698 
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AA3 16804 


132 


ESHVHQEPSKRJPS 


239-252 






HTKRKSSTMVKEGW 


409-422 






PSDLDVERDEEAVK 


375-388 






SPGQGKDHKDLSTS1 


543-557 


R47805 


143 


EPVGRWDQDYDRAVL 


44-58 






KPKGPGGKRGHKRLI 


325-339 






PTDVAQLPSRVPRDA 


219-233 


AA234451 


167 


DPFDWEKTGNDGSLT 


293-307 






HPRPQEKDVWEE 


374-385 






RENTDEVFPDEQLSD 


340-354 






RSEITQPDRDIPLVR 


427-441 


AA460132 


180 


LKSYSTSSKKARPVL 


222-236 






KKLDEVRLRGRKRSM 


237-251 






ETEKTA QGLSNLAKT 


131-145 


N34132 


183 


SGRRRRPTKSKGSKS 


1848-1862 






PGTAPSKPPLTKAPV 


1474-1488 






VDSDTQPKAPGIDD 


1365-1378 






AHSLDKTSHSSTTGL 


1253-1267 


5R69-17-2 


187 


GTTREKTDRVKST 


178-190 






HSEAPELHGKJRSSN 


138-152 






DETVTPPQFSIV 


87-98 






QYDVKSEIYS 


204-213 


AA278842 


206 


TVDPEKSVRDQAFKA 


515-529 






DSSTADRWDDEDWGS 


637-651 






SVSEDPTQLEEVEKD 


539-553 


AA836348 


232 


N APTKRPRS ST VTEA 


323-337 






LDSEEDYYTPQKVDV 


514-528 






GDKASYRQPKHVEKL 


409-423 
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EXAMPLE 4. Expression analysis of Novel Mammalian Protein Kinases 
GENE EXPRESSION ANALYSIS 
Tissue Arrays 

"cDNA libraries" derived from a variety of sources were immobilized onto nylon 
5 membranes and probed with 32P-labeled cDNA fragments derived from the gene(s) of 

interest. 

Total RNA or mRNA was used as template in a reverse transcription reaction to 
generate single-stranded cDNAs (ss cDNA) that were tagged with specific sequences at 
each end. An oligo dT primer containing a specific sequence (CDS: 

10 AAGCAGTGGTAACAACGCAGAGTACT30VN (V=A,G,C N=A,G,C,T)) anneals at 
the polyA track at the 3' end of the mRNA and the reverse transcriptase (MMLV 
RnaseH-) transcribes the antisense strand until it reaches the end of the RNA strand when 
it adds additional C residues. If a primer (SMII: 
AAGCAGTGGTAACAACGCAGAGTACGCGGG or ML2G: 

15 AAGTGGCAACAGAGATAACGCGTACGCGGG) ending with 3 Gs is added, it anneals 

to the added Cs and the MMLV recognizes the rest of the primer sequence as template and 
continues transcription. As a result, the synthesized cDNAs contain specific sequence tags 
at both the 5' and the 3' end. When the 5' and the 3' ends are tagged with the same 
sequence (CDS and SMII) it is referred to as "symmetric." When the 5' end is tagged 

20 with a different sequence than the 3' end (CDS and ML2G) is referred to as "asymmetric" 

A double-stranded "cDNA library " is then generated by PCR amplification using the 
3'PCR and ML2 primers (3' PCR: AAGCAGTGGTAACAACGCAGAGT and ML2: 
AAGTGGCAACAGAGATAACGCGT) that anneal to the added sequence tags. 

The amplified "cDNA libraries" were manually arrayed onto nylon membranes 

25 with a 384 pin replicator. The DNA was denatured by alkali treatment, neutralized and 
cross-linked by UV light. The arrays were pre-hybridized with Express Hyb (Clontech) 
and hybridized with 32P labeled probes generated by random hexamer priming of cDNA 
fragments corresponding to the genes of interest. After washing, the blots were exposed to 
phosphorimaging cassettes and the intensity of the signal was quantified. The amount of 

30 the DNA on the arrays was also quantified by treating non-denatured or denatured arrays 
with Syber Green I or Syber Green II respectively (1 : 100,000 in 50mM Tris, pH8.0) for 2 
minutes. After washing with 50mM Tris, pH8.0, the fluorescent emission was detected 
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with a phosphorimager (Molecular Dynamics) and quantified. The amount of the arrayed 
DNA was used to normalize the hybridization signal and the corrected values are tabulated 
in Table 3. 

5 Results 

The results of the microarray expression analysis of the protein kinases presented 
in this application is shown in Table 3. Data presentation from left to right is as follows: 
"Tissue": tissue type of the cDNA; "Tumor sym", indicates that the tissue is derived from 
a tumor, "sym" refers to the fact that the 5' and 3' primers used to make the sample are the 

10 same; "Normal Sym", indicates normal tissue was used to make the sample, with 

symmetric primers as described above; "Tumor lo", indicates that primary tumor tissue 
was used to make the cDNA; "Tumor cells", indicates that these cDNA samples were 
made from cultured tumor cells; "Normal", indicates that these samples are derived from 
normal tissue or cell lines; "Endos", indicates that these samples are derived from 

15 endothelium-related tissue sources; "p53" refers to the status, mutant or wild-type, of the 

p53 gene in the source samples. Normalized expression values are presented for each 
gene referred to by its SEQ ED# on the subsequent columns. Genes represented in 
expression Table 3 are: SEQ ID NO:3 (AA826850), SEQ ID NO:5 (TBK1), SEQ ED NO:6 
(AA305176), SEQ ID NO:8 (AA256100), SEQ ID NO:9 (CAB43292), SEQ ID NO:l 1 

20 (EPK2), SEQ ID NO:12 (PKNbeta), SEQ ID NO:14 (H19102), SEQ ID NO:16 (RSK4), 
SEQ ID NO: 17 (AAD30182), SEQ ID NO:20 (SGK2), SEQ ID NO:22 (PTK9L), SEQ ID 
NO:26 (AA383293), SEQ ID NO:29 (DRAK2), SEQ ID NO:31 (DRAK1), SEQ ID 
NO:032 (AA015726), SEQ ID NO:40 (MAK-V), SEQ ID NO:044 (TRAD), SEQ ID 
NO:044 (TRAD), SEQ ID NO:45 (AA454060), SEQ ID NO:47 (AA234451), SEQ ID 

25 NO:48 (AA436054), SEQ ID NO:49 (AA626859), SEQ ID NO:51 (KIAA0904), SEQ ID 

NO:52 (AA789239), SEQ ID NO:54 (CCRK), SEQ ID NO:55 (CLK4), SEQ ID NO:56 
(AA557536), SEQ ID NO:57 (W56561), SEQ ID NO:60 (AA579641), SEQ ID NO:63 
(NEK7), SEQ ID NO:66 (CAMKXB), SEQ ID NO:68 (HIPK2), SEQ ID NO:72 
(R19609), SEQ ID NO:73 (HRI), SEQ ID NO:78 (AA088547), SEQ ID NO:79 

30 (AA449542), SEQ ID NO:082a (MLK4), SEQ ID NO:82 (MLK4b), SEQ ID NO:84 
(RDM), SEQ ID NO:88 (AA278842), SEQ ID NO:89 (AA1 95964), SEQ ID NO:90 
(MSSK1), SEQ ID NO:93 (TSK4), SEQ ID NO:94 (AI025291), SEQ ID NO:95 
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(AA948538), SEQ ID NO:96 (AA905446), SEQ ID NO:97 (H85389), SEQ ID NO: 100 
(AA018361), SEQ ID NO: 101 (AA31 1714), SEQ ID NO:l 10 (AA452647), SEQ ID 
NO:l 1 1 (AA310219), SEQ ID NO: 1 12 (AI086865), SEQ ID NO:l 14 (MEKX6), and SEQ 
IDNO:116(SuRTK106). 

5 

EXAMPLE 5. Kinase assays for Erk, JNK1 and p38 MAP kinases 

293T cells were transiently transfected with HA- p38 or co-transfected with Flag- 
tagged wt MLK4A, kinase-dead MLK4A, wild-type MLK4B or kinase-dead MLK4B 
using Lipofectamine 2000 (Lifetech). Cells were lysed 36 hr post-transfection. Cell 

10 lysates normalized to contain equivalent amounts of HA-p38 were immunoprecipitated 

with anti-HA antibody (Mab HA-1 1, Babco). Immunoprecipitates were split in two 
portions, one portion was Western-blotted with anti- HA antibody and the other with a 
phospho-specific p38 antibody (Promega) to detect activated levels of p38. Activation of 
Erkl and Jnkl was measured similarly. (This example applies to AA232253 (SEQ ID 

15 NO:82,SEQIDNO:201).) 

Results : 

In transient assays wild-type MLK4A and MLK4B (but not kinase-inactive 
MLK4A(K45M) or MLK4B(K45M)) activate Erk, JNK1 and p38 MAP kinases. 
20 EXAMPLE 6. RAC1 guanine-exchange factor assay 

293T cells were transiently transfected with HA-Racl or co-transfected with Flag- 
tagged Duet C, Duet E, Dbl and HA-Tiam-1 . Cells were lysed 36 hour post-transfection. 
Cell lysates normalized to contain equivalent amounts of Racl were affinity precipitated 
with immobilized GST-PBD (p21 -binding domain of Pak3). Bound proteins were 
25 Western blotted and probed with anti-HA antibody to detect levels of activated Racl 

((This example applies to R199772 (Trad/Duet)(SEQ ID NO:44, SEQ ID NO:164).) 



Results : 

Duet C and Duet E both act as guanine nucleotide exchange factors on Racl . 



WO 00/73469 



PCT/US00/14842 



131 



CONCLUSION 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
5 those inherent therein. The molecular complexes and the methods, procedures, treatments, 

molecules, specific compounds described herein are presently representative of preferred 
embodiments are exemplary and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention are defined by the scope of the claims. 

1 0 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the specification are indicative of the 
levels of those skilled in the art to which the invention pertains. 

15 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of ' and "consisting of may be replaced with either 
of the other two terms. The terms and expressions which have been employed are used as 

20 terms of description and not of limitation, and there is no intention that in the use of such 

terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. 

In particular, although some formulations described herein have been identified by 

25 the excipients added to the formulations, the invention is meant to also cover the final 

formulation formed by the combination of these excipients. Specifically, the invention 
includes formulations in which one to all of the added excipients undergo a reaction 
during formulation and are no longer present in the final formulation, or are present in 
modified forms. 

30 In addition, where features or aspects of the invention are described in terms of 

Markush groups, those skilled in the art will recognize that the invention is also thereby 
described in terms of any individual member or subgroup of members of the Markush 
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group. For example, if X is described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and claims for X being bromine and 
chlorine are fully described. 

Other embodiments are within the following claims. 
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What is claimed is: 

CLAIMS 

1. An isolated, enriched, or purified nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 
5 ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33 , SEQ 
ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 

1 0 ID NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ 

ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ 
ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 

1 5 ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 

ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ 
ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 188, SEQ 
ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93 , SEQ 
ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 

20 ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 

ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:21 5, SEQ ID NO:21 6, SEQ ID NO:21 7, SEQ ID NO:21 8, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 

25 ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 
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2. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule 
comprises a nucleotide sequence that: 

(a) encodes a polypeptide comprising the amino acid sequence set forth 
in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 



NO:126, 


ri'TA rT\ XTA. 1 0 1 

SEQ ID NU.l//, 


obU 


TF> 
ID 


XJA. 1 OR 
N\J. 1ZO, 


CCA 


TFl 
ID 


INU.lzV, 


CCA 

oHv 


ID 


1NU.1 Jv, 




TD 


NO:l3l, 


cca rn\ xta- 1 1 o 
SbQ LU NU. i jz, 


CCA 


ID 


xirv 1 77 

INVJ. 1 Jj, 


CCA 


TFl 
LU 


NvJ. 1 d4, 


CCA 

oJCU 


in 


lNW. 1 J J, 


SFO 


ID 


NO:l36, 


OT^/"V TT\ VIA. in 

SEQ ID NO:137, 


cc a 
dbQ 


tf\ 
ID 


XTA« 1 

NU. 1 Jo, 


bbQ 


TFv 
ID 


XT A. no 
NU. 


CPA 

obU 


ID 




CCA 


TFi 


NO:l4l, 


SEQ ED NO:142, 


CCA 


ID 


XTA. 1 /IT 

NU: 143, 


err a 
bbQ 


ID 


XT A. ~\ A A 

NU:144, 


CUA 

bbv 


ID 


XT A. I /1C 

NU. i^fj, 


ccn 




NO:l46, 


SEQ ID NO:147, 


SbQ 


ID 


XTA. 1 /I O 

NU. 14o, 


cc a 
bbQ 


ID 


XT/~\. 1 A C\ 

NU. 14V, 


CCA 

obQ 


ID 


XTA. 1 CA 

NU. 1 ->U, 


CCA 


TF> 
ID 


NO:l5l, 


SEQ ED NO: 152, 


SEQ 


ID 


xt a. i ci 
NU:1:>J, 


CCA 

bbQ 


T*T\ 
ID 


XT A- 1 C A 

NU.l j4, 


OCA 

obQ 


ID 


XTA. ICC 

NU. 1 jJj 


CPA 
o£y 




NO:l56, 


SEQ ID NO: 157, 


nrA 

SEQ 


ID 


XTA. ICQ 

NU: 1 jo, 


cea 
SEQ 


ID 


XTA- ICO 

NU. J jy, 


CCA 

bbU 


JD 


XTA. 1 /CA 

NvJ. 1 OU, 


CCA 


ID 


NO:l61, 


f>T""/"V T T — V \1A 1 

SEQ ID NO: 162, 


SEQ 


ID 


XT A. 1 

NU:loi, 


CCA 


ID 


XT A. 1 £L A 

NU:lo4, 


CCA 


TFl 
ID 


XTA. 1 /CC 

NU. J Oj. 


CCA 


IFl 


NO:l66, 


SEQ ID NO:167, 


SEQ 


ID 


NU:loo, 


bbQ 


ID 


\TA. 1 /CO 

NU:loy, 


CCA 

obQ 


7Fk 

ID 


XTA. 1 *7A 

JNU.J /U, 


ccn 


in 


NO:l7l, 


nr>A ) r\ \1A 1 

SEQ ID NO: 1 72, 


nrA 

SEQ 


ID 


NO: 173, 


CCA 

bbQ 


TTpv 
ID 


XT A. 1 HA 

NU.l /4, 


CCA 

bbQ 


TFl 
ID 


XTA. 1 7C 


CCA 

oilU 


TFl 


NO:l76, 


SEQ ID NO: 177, 


crA 

SEQ 


ID 


xta. no 

NU:1 /o, 


CCA 

bbQ 


ID 


xta. i in 
NU:1 Jy, 


CCA 

bbQ 


ID 


X7A. 1 OA 

1>U. 1 oU, 


CCA 




NO:l8l, 


TT\ VTA 1 O O 

SEQ ID NO: 1 82, 


CTA 

SEQ 


ID 


NU:loi, 


CCA 

bbQ 


ID 


XT A - 1 OA 


CCA 

bbU 


ID 


NU. 1 oj, 


CCA 


TFi 


NO:l86, 


SEQ ID NU:lo/, 


CCA 


TFl 

ID 


xta- l cc 
NU. 1 oo, 


CpA 


TFl 
ID 


NU. ioy, 


CCA 

oLy 




xjA. i on 


uEy 


TT> 

ij-/ 


NO:l9l, 


O V /^v TT*\ X"T/*V 1 C\C\ 

SEQ ID NU. ivy, 


oca 


ID 


XJA* 1 AO 

NU. iyj, 


CCA 


ID 




CCA 


TF> 


XJA' 1 QS 




ID 


xta. i n/c 

NU: lyo, 


oca tf> XTfY 1 Q7 
oty ID iNvj.iy/, 




rn 


IN V_/. 1 70 , 


SFO 


TD 


NO- 199 


SFO 


ID 


NO'200 


SEQ 


ID 


NO:201, 


SEQ ID NO:202, 


SEQ 


ID 


xta nm 

NU:203, 


SbQ 


ID 


NU:2U4, 




TFl 

ID 




ccn 


TFi 


NO:206, 


SEQ ID NO:207, 


SEQ 


ID 


NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ID 


NO:211, 


SEQIDNO:212, 


SEQ 


ID 


NO:213, 


SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


ID 


NO:216, 


SEQIDNO:217, 


SEQ 


ID 


NO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID 


NO:221, 


SEQ ID NO:222, 


SEQ 


ID 


NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID 


NO:226, 


SEQ ID NO:227, 


SEQ 


ID 


NO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


ID 


NO:231, 


SEQ ID NO:232, 


SEQ 


ID 


NO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 


ID 


NO:236, 


SEQ ID NO:237, 


SEQ 


ID 


NO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQ 


ID 


NO:241, 


and SEQ ID NO:242; 























(b) is the complement of the nucleotide sequence of (a); 
30 (c) hybridizes under highly stringent conditions to the nucleotide 

molecule of (a) and encodes a naturally occurring kinase polypeptide; 
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(d) encodes a kinase polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ED NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 



SEQ ID NO:130, SEQ ID NO:131 
SEQ ED NO:135, SEQ ED NO:136 ; 
SEQ ED NO:140, SEQ ED NO:141 
SEQ ED NO: 145, SEQ ED NO: 146 
SEQ ID NO:150, SEQ ED NO:151 
SEQ ED NO: 155, SEQ ED NO: 156 
SEQ ID NO: 160, SEQ ED NO: 161 
SEQ ED NO: 165. SEQ ID NO: 166 
SEQ ID NO: 170, SEQ ID NO: 171 
SEQ ED NO: 175, SEQ ED NO: 176 
SEQ ED NO: 180, SEQEDNO:181 
SEQ ED NO: 185, SEQ ED NO:186 
SEQ ED NO: 190, SEQ ID NO: 191 
SEQ ED NO: 195, SEQ ID NO: 196 
SEQ ED NO:200, SEQ ED NO:201 
SEQ ED NO:205, SEQ ED NO:206 
SEQ ID NO:2 1 0, SEQ ED NO:2 1 1 
SEQ ID NO:215, SEQ ED NO:216 
SEQ ED NO:220, SEQ ED NO:221 
SEQ ED NO:225, SEQ ED NO:226 
SEQ ID NO:230, SEQ ID NO:231 
SEQ ED NO:235, SEQ ED NO:236 
SEQ ID NO:240, SEQ ID NO:241 



SEQ ID NO:132, SEQ ID NO: 133, SEQ ID NO:134, 
SEQ ID NO: 137, SEQ ED NO: 138, SEQ ID NO: 1 39, 
SEQ ED NO:142, SEQ ED NO:143, SEQ ID NO:144, 
SEQ ID NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, 
SEQ ED NO: 152, SEQ ED NO: 153, SEQ ID NO: 154, 
SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, 
SEQ IDNO:162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ED NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ID NO: 182, SEQ ID NO: 1 83, SEQ ID NO:l 84, 
SEQ ID NO:l 87, SEQ ID NO:l 88, SEQ ID NO:l 89, 
SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 
SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, 
SEQ ID NO:207, SEQ ED NO:208, SEQ ED NO:209, 
SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, 
SEQ ID NO:227, SEQ ED NO:228, SEQ ID NO:229, 
SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 



and SEQ ID NO:242, except that it lacks one or more, 
but not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; 

(e) is the complement of the nucleotide sequence of (d); 
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(f) encodes a domain of an amino acid sequence selected from the 
group set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO: 1 26, SEQ ID NO: 127, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, 

5 SEQIDNO:136, SEQ ED NO: 137, SEQ ID NO: 138, SEQIDNO:139, SEQIDNO:140, 

SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, 
SEQ ED NO:146, SEQ ID NO:147, SEQ ED NO:148, SEQ ID NO:149, SEQ ID NO:150, 
SEQ ED NO: 151, SEQ ID NO:152, SEQ ED NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ED NO: 156, SEQ ED NO:157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 

10 SEQ ED NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO:l 64, SEQ ED NO: 165. 
SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, 
SEQ ED NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, 
SEQ ED NO: 176, SEQ ED NO: 177, SEQ ED NO: 178, SEQ ED NO: 179, SEQ ED NO: 180, 
SEQ ED NO:181, SEQ ED NO:182, SEQ ED NO:183, SEQ ID NO:184, SEQ EDNO:185, 

15 SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID NO:l 89, SEQ ID NO: 190, 

SEQ ED NO: 1 91 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ED NO: 1 96, SEQ ED NO.l 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, 
SEQ ED NO:201 , SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ED NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ED NO:210, 

20 SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 

SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, 
SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, 
SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, 

25 SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, 

SEQ ED NO:241 , and SEQ ED NO:242, wherein said domain is selected from the group 
consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil 
structure region, a proline-rich region, a spacer region, an insert, and a C-terminal tail; 

(g) is the complement of the nucleotide sequence of (f); 

30 (h) encodes a polypeptide comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ED NO:126, SEQ ED NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
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NO:130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ E) NO: 133, SEQ ID NO: 134, SEQ ID 
NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ ID 
NO:U0, SEQ ID NO:141, SEQ ID N0.142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID 
NO:145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 

5 NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 

NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID 
NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID 

10 NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID 

NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID 

15 NO:200, SEQ ID NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED 
NO:21 0, SEQ ID NO:21 1 , SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ IDNO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 

20 NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but not 
all, of the domains selected from the group consisting of an N-terminal domain, a catalytic 

25 domain, a C-terminal domain, a spacer region, a proline-rich region, a coiled-coil structure 
region, and a C-terminal tail; or 

(i) is the complement of the nucleotide sequence of (h). 
3. The nucleic acid molecule of claim 1 , further comprising a vector or 
promoter effective to initiate transcription in a host cell. 
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4. The nucleic acid molecule of claim 1 , wherein said nucleic acid molecule is 
isolated, enriched, or purified from a mammal. 

5. The nucleic acid molecule of claim 4, wherein said mammal is a human. 

6. A nucleic acid probe for the detection of nucleic acid encoding a kinase 

5 polypeptide in a sample, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO: 127, SEQ ED NO: 128, SEQ ID NO:129, SEQ ID NO: 130, SEQ ID NO:131, 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO:134, SEQ ID NO: 135, SEQ ID NO: 136, 
SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, 

10 SEQ ED NO: 1 42, SEQ ED NO: 1 43, SEQ ED NO: 1 44, SEQ ID NO: 1 45 , SEQ ID NO: 1 46; 
SEQ EDNO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, 
SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, 
SEQ ID NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, 

15 SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 
SEQ ID NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, 
SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, 
SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, 

20 SEQ ED NO: 199, SEQ ED NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, 

SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ DO NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 

25 SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 

SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, 

30 and SEQ ID NO:242. 
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7. The probe of claim 6, wherein said polypeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of SEQ DD 
NO: 1 22, SEQ ID NO: 123, SEQ ID NO: 1 24, SEQ ID NO: 1 25 , SEQ ID NO: 1 26, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO:137, SEQIDNO:138, SEQIDNO:139, SEQ ID NO:140, SEQIDNO:141, SEQ ID 
NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ED NO: 1 45 , SEQ ID NO: 1 46, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 

10 NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ DD NO: 1 60, SEQ ED NO: 1 61 , SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ DD NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ DD NO: 1 68, SEQ DD NO: 1 69, SEQ DD NO: 1 70, SEQ ED NO: 17 1 , SEQ ID 
NO: 172, SEQ ED NO: 173, SEQ DD NO: 174, SEQ DD NO: 175, SEQ ED NO: 176, SEQ ID 
NO: 1 77, SEQ DD NO: 1 78, SEQ DD NO: 1 79, SEQ DD NO: 1 80, SEQ DD NO: 1 8 1 , SEQ DD 

15 NO: 1 82, SEQ ID NO: 1 83, SEQ DD NO: 1 84, SEQ DD NO : 1 85, SEQ DD NO: 1 86, SEQ ID 
NO: 1 87, SEQ DD NO: 1 88, SEQ DD NO: 1 89, SEQ DD NO: 1 90, SEQ ID NO: 1 91 , SEQ ID 
NO: 1 99, SEQ ED NO: 1 93, SEQ DD NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ DD NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ DD NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ DD NO:2 1 1 , SEQ ID 
NO:212, SEQ ED NO:213, SEQ DD NO:214, SEQ DD NO:215, SEQ ID NO:216, SEQ ID 
NO:2 1 7, SEQ DD NO:2 1 8, SEQ DD NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
N0.222, SEQ DD NO:223, SEQ DD NO:224, SEQ DD NO:225, SEQ ID NO:226, SEQ DD 
NO:227, SEQ ED NO:228, SEQ DD NO:229, SEQ DD NO:230, SEQ ED NO:231, SEQ DD 

25 NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ DD NO:235, SEQ DD NO:236, SEQ ID 
NO:237, SEQ DD NO:238, SEQ DD NO:239, SEQ DD NO:240, SEQ ED NO:241, and SEQ 
EDNO:242. 
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8. A recombinant cell comprising a nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ED NO: 123, SEQ 
ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ 
ID NO: 129, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ 
5 ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ E) NO: 1 37, SEQ ID NO: 1 38, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ 
ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 
ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ 

10 ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63 , SEQ 
ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 
ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ 

15 ID NO: 1 84, SEQ ED NO: 185, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 

ID NO: 189, SEQ ID NO: 190, SEQ ID NO:191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ 
ID NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ 

20 ID NO209, SEQ ED NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ED NO:2 1 3 , SEQ 
ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ 
ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ 

25 ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ 
ID NO:239, SEQ ED NO.240, SEQ ED NO:241, and SEQ ED NO:242. 
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9. The cell of claim 8, wherein said polypeptide is a fragment of a protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 

5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 1 37, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO:142, SEQ IDNO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 

0 NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 159, SEQ ID NO: 1 60, SEQ ID NO:l 61 , SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ E) NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

5 NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 

NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ IDNO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

>0 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0.21 1, SEQ ID 

NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

15 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 
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10. An isolated, enriched, or purified kinase polypeptide selected from the 
group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ID NO:135, 
SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, 
SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ED NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO: 151, SEQ ED NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 
SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. 
SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, 
SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 1.85, 
SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO:l 88, SEQ ID NO: 1 89, SEQ ID NO:l 90, 
SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NQ:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO.220, 
SEQ ID NO:221 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID N0.232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 
SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID NO:240, 
SEQ ED NO:241, and SEQ ID NO:242. 
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11. The polypeptide of claim 10, wherein said polypeptide is a fragment of the 
protein encoded by an amino acid sequence selected from the group consisting of SEQ ID 





NO:122, 


SEQ 


ID NO: 123, 


SEQ 


ID 


NO:124, 


SEQ 


ID 


NO:125, 


SEQ 


ID 


NO:126, 


SEQ 


ID 




NO: 127, 


SEQ 


E)NO:128, 


SEQ 


ID 


NO: 129, 


SEQ 


ID 


NO: 130, 


SEQ 


ID 


NO:131, 


SEQ 


ID 
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NO:132, 


SEQ 


ID NO: 133, 


SEQ 


ID 


NO: 134, 


SEQ 


ID 
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SEQ 


ID 


NO:136, 


SEQ 


ID 




NO:137, 


SEQ 


ID NO: 138, 


SEQ 


ID 


NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ED 
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SEQ 


ID 




NO: 142, 


SEQ 


ID NO: 143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 
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SEQ 


ED 


NO: 146, 


SEQ 


ID 




NO: 147, 


SEQ 


ID NO: 148, 


SEQ 


ID 
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SEQ 


ID 


NO:150, 


SEQ 


ID 


NO:151, 


SEQ 


ID 




NO: 152, 


SEQ 


IDNO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO:155, 


SEQ 


ID 


NO:156, 


SEQ 


ID 


10 


NO: 157, 


SEQ 


ID NO: 158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID 


NO:161, 


SEQ 


ID 




NO: 162, 


SEQ 
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SEQ 


ID 


NO: 164, 


SEQ 


ID 
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SEQ 


ID 


NO: 166, 


SEQ 


ID 




NO: 167, 


SEQ 


ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ED 
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SEQ 


ID 


NO:171, 


SEQ 


ID 




NO: 172, 


SEQ 


IDNO:173, 


SEQ 


ID 


NO: 174, 


SEQ 


ID 


NO:175, 


SEQ 


ID 


NO: 176, 


SEQ 


ID 




NO: 177, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


ID 


NO:181, 


SEQ 


ID 


15 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


ID 


NO: 186, 


SEQ 


ID 




NO: 187, 


SEQ 


ID NO: 188, 


SEQ 


ID 


NO:189, 


SEQ 


ED 


NO: 190, 


SEQ 


ID 


NO:191, 


SEQ 


ID 




NO: 199, 


SEQ 


EDNO:193, 


SEQ 


ID 


NO: 194, 


SEQ 


ED 


NO: 195, 


SEQ 


ID 


NO: 196, 


SEQ 


ID 




NO: 197, 


SEQ 


ID NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ED 


NO:201, 


SEQ 


ID 




NO:202, 


SEQ 


ID NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ED 


NO:206, 


SEQ 


ID 


20 


NO:207, 


SEQ 


ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ED 


NO:211, 


SEQ 


ID 




NO:212, 


SEQ 


EDNO:213, 


SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


ID 


NO:216, 


SEQ 


ID 




NO:217, 


SEQ 


IDNO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID 


NO:221, 


SEQ 


ID 




NO:222, 


SEQ 


EDNO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID 


NO:226, 


SEQ 


ID 




NO:227, 


SEQ 


IDNO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


ID 


NO:231, 


SEQ 


ED 


25 


NO:232, 


SEQ 


IDNO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 


ID 


NO:236, 


SEQ 


ID 




NO:237, 


SEQ 


IDNO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQ 


ID 


NO:241, 


and SEQ 



ID NO:242. 

12. The polypeptide of claim 10, wherein said polypeptide comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ED NO:123, SEQ ED NO:124, SEQ ID NO:125, SEQ ID N0.126, SEQ 

ED NO: 1 27, SEQ ED NO: 1 28, SEQ ED NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ 
ED NO: 132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ED NO: 135, SEQ ID NO: 136, SEQ 
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ID N0.137, SEQ ID NO: 138, SEQ ID NO:139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
IDNO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO:147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ 
ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ 
ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 
ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ 
ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ 
ID NO:199, SEQ ID NO:193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and 
SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ 
ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ 
E) NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ 
ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ 
ID NO: 1 42, SEQ ED NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ 
ID NO:l 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ 
ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 
ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ 
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ID NO: 167, SEQ ID NO: 168, SEQ ED NO:169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ 
ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ 
ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ 
ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ 

5 ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ 
ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ 

10 ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 

ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 

15 ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 

SEQ ID NO:242, except that it lacks one or more, but not all of the domains selected from 
the group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail 
(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 

NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID 
NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 
NO:136, SEQ IDNO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO: 1 41 , SEQ ID NO: 142, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID 

25 NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID 
NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ED NO: 155, SEQ ID 
NO: 156, SEQ ID NO: 1 57, SEQ ID NO:l 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID 
NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID 

30 NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO:l 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID 
NO:181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO:l 84, SEQ ID NO: 185, SEQ ID 
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NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID 
NO:191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID 
NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO :200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID N0.218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

] 3. The kinase polypeptide of claim 1 0, wherein said polypeptide is isolated, 
purified, or enriched from a mammal. 

14. The kinase polypeptide of claim 13, wherein said mammal is a human. 

15. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA144574, AA1 16841, AA256100, AA305176, AA210825, AA3 16804, AA980090, 
N42050, AA476563, AA626690, AA960957, HI 91 02, AA045601, AA107515, 

AA 109508 or AA887783 polypeptide. 

16. The kinase polypeptide of claim 10, wherein said polypeptide is a H60215, 
AA1 97883, AA297313, W30246, AA1 72300, AA383293, AA542015, H01 248, N23936, 
W44160, 2R22-5-1 1, 5R72-18-1, AA021445, AA207220, AA426580, AA544838, 
W90839, 5R79-54-1, AA839940, R19772 or 5R72-8-2 polypeptide. 

1 7. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 

AA234451 polypeptide. 

18. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 5R65- 1 6- 
1, AA061797, AA065538, AA124976, AA397553, AA435956, AA575635, AA626859, 
AA789239, AI086865, H 1 7727, H29974, AA557536 or N28606 polypeptide. 

1 9. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA63 1 990 or W08549 polypeptide. 
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20. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 5R72-1 6- 
2, R19927 or R43524 polypeptide. 

21. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R57-10- 
2 polypeptide. 

22. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA232253 polypeptide. 

23. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA430250, AA836348, R86668 or N34132 polypeptide. 

24. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA098024or SuRTKl 06 polypeptide. 

25. The kinase polypeptide of claim 10, wherein said polypeptide is a R47805, 
AA099102, AA589241, H8581 1, AA013524, AA452647, AA840598, AA088547, 
AA139478, AA826850, R87679, W65887, H97685, W20810, AA599286, AA425725, 
AA103218, AA71 1829, AA060026, AA399669, AA758539, AA883975, AA948538, 
AA018361, AA215311, AA311714, AA498104, 5R69-17-2, 5R69-23-3, 5R69-26-2, 
AA1 1 8352, AA396601 , AA671275, AA278842, AA4601 32 or H0572 1 polypeptide. 
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26. An antibody or antibody fragment having specific binding affinity to a 
kinase polypeptide selected from the group consisting of SEQ ID NO: 1 22, SEQ ID 
NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO:l 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ IDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

27. The antibody or antibody fragment of claim 26, wherein said polypeptide 
comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ 
ID N0.132, SEQ ID NO: 133, SEQ IDNO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
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ID 


NO:137, 


SEQ 


IDNO:138, 


SEQ 


ID NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID NO:141, 


SEQ 


ID 


NO:142, 


SEQ 


IDNO:143, 


SEQ 


ID NO: 144, 


SEQ 


ID 


NO:145, 


SEQ 


ID NO: 146, 


SEQ 


ID 


NO:147, 


SEQ 


IDNO:148, 


SEQ 


ID NO: 149, 


SEQ 


ID 


NO: 150, 


SEQ 


IDNO:151, 


SEQ 


ID 


NO:152, 


SEQ 


IDNO:153, 


SEQ 


IDNO:154, 


SEQ 


ID 


NO:155, 


SEQ 


ID NO: 156, 


SEQ 


ID 


NO: 157, 


SEQ 


IDNO:158, 


SEQ 


ID NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID NO:161, 


SEQ 


ID 


NO: 162, 


SEQ 


IDNO:163, 


SEQ 


ID NO: 164, 


SEQ 


ID 


NO: 165. 


SEQ 


ID NO: 166, 


SEQ 


ID 


NO:167, 


SEQ 


ID NO: 168, 


SEQ 


ID NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


IDNO:171, 


SEQ 


ID 


NO: 172, 


SEQ 


IDNO:173, 


SEQ 


ID NO: 174, 


SEQ 


ID 


NO: 175, 


SEQ 


ID NO: 176, 


SEQ 


ID 


NO:177, 


SEQ 


IDNO:178, 


SEQ 


ID NO: 179, 


SEQ 


ED 


NO: 180, 


SEQ 


IDNO:181, 


SEQ 


ID 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


IDNO:186, 


SEQ 


ID 


NO: 187, 


SEQ 


IDNO:188, 


SEQ 


ID NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


IDNO:191, 


SEQ 


ID 


NO: 199, 


SEQ 


ID NO: 193, 


SEQ 


ID NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID NO: 196, 


SEQ 


ID 


NO: 197, 


SEQ 


IDNO:198, 


SEQ 


ID NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID NO:201, 


SEQ 


ID 


NO:202, 


SEQ 


IDNO:203, 


SEQ 


ID NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID NO:206, 


SEQ 


ID 


NO:207, 


SEQ 


ID NO:208, 


SEQ 


ID NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


IDNO:211, 


SEQ 


ID 


NO:212, 


SEQ 


IDNO:213, 


SEQ 


IDNO:214, 


SEQ 


ID 


NO:215, 


SEQ 


IDNO:216, 


SEQ 


ID 


NO:217, 


SEQ 


IDNO:218, 


SEQ 


IDNO:219, 


SEQ 


ID 


NO:220, 


SEQ 


IDNO:221, 


SEQ 


ID 


NO:222, 


SEQ 


IDNO:223, 


SEQ 


ID NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID NO:226, 


SEQ 


ID 


NO:227, 


SEQ 


ID NO:228, 


SEQ 


IDNO:229, 


SEQ 


ID 


NO:230, 


SEQ 


IDNO:231, 


SEQ 


ID 


NO:232, 


SEQ 


IDNO:233, 


SEQ 


IDNO:234, 


SEQ 


ED 


NO:235, 


SEQ 


ID NO:236, 


SEQ 


ID 


NO:237, 


SEQ 


IDNO:238, 


SEQ 


EDNO:239, 


SEQ 


ID 


NO:240, 


SEQ 


IDNO:241, 


and 



SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO:126, SEQ 

25 ID NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, SEQ 
ID NO: 1 32, SEQ ED NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ 
ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ 
ID NO: 142, SEQ ED NO:143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID N0.146, SEQ 
IDNO:147, SEQEDNO:148, SEQIDNO:149, SEQIDNO:150, SEQIDN0:151, SEQ 

30 ID NO: 1 52, SEQ ID NO:l 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO:l 56, SEQ 
ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 
ID NO: 162, SEQ ID NO:163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO:166, SEQ 
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ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO:l 7 1 , SEQ 
ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ 
ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ 
ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
5 ED NO: 1 87, SEQ ED NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ 
ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ 
ID NO: 1 97, SEQ ED NO: 1 98, SEQ ID NO: 1 99, SEQ ED NO:200, SEQ ID NO.201 , SEQ 
ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ 

10 ED NO:212, SEQ ED NO:213, SEQ ID NO:2I4, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:2 1 7, SEQ ED NO:2 1 8, SEQ ED NO:2 1 9, SEQ ED NO:220, SEQ ID NO:22 1 , SEQ 
ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ 
ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:231, SEQ 
ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ 

1 5 ED NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ED NO:241 , and 

SEQ ED NO:242, except that it lacks one or more, but not all, of the domains selected from 
the group consisting of a C-terminal domain, a catalytic domain, an N-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ED NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID 
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NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93 , SEQ ID NO: 1 94, SEQ ID NO: 195, SEQ ID 
NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ED 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 
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28. A hybridoma which produces an antibody having specific binding affinity 
to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 

5 NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO:151, SEQ ID NO: 152, SEQ ID 
NO:l 53, SEQ ID NO:l 54, SEQ ID NO: 1 55, SEQ ID NO:l 56, SEQ ID NO: 1 57, SEQ ID 

10 NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 

15 NO: 1 83, SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 

NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242. 

29. A method for identifying a substance that modulates kinase activity 
comprising: 

(a) contacting a kinase polypeptide selected from the group consisting 
30 SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ ID NO.130, SEQ ID NO:131, 
SEQ ED NO: 132, SEQ ED NO:133, SEQ ID NO:134, SEQ ID NO: 135, SEQ ID NO: 136, 
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and SEQ ID NO:242 with a test substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining whether said substance modulates the activity of said 

25 polypeptide. 

30. A method for identifying a substance that modulates kinase activity in a 
cell comprising: 

(a) expressing a kinase polypeptide in a cell, wherein said polypeptide 
is selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
30 NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID 
NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID 
NO:134, SEQ ID NO:135, SEQ ID NO:l 36, SEQ ID NO:137, SEQ ID NO:138, SEQ ID 
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NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID 
NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
NO: 1 49, SEQ ID N0:1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID 
NO: 1 54, SEQ ID NO:l 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID 
NO: 159, SEQ ID NO:160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID 
NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO: 169, SEQ ED NO: 170, SEQ ID NO:171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID 
NO: 1 74, SEQ ED NO:l 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83 , SEQ ID 
NO: 184, SEQ ED NO:185, SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID 
NO: 189, SEQ ID NO:190, SEQ ED NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ ID 
NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ID 
NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED 
NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ED 
NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID 
NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ ID NO:232, SEQ ED NO:233, SEQ ID 
NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ED NO:242; 

(b) adding a lest substance to said cell; and 

(c) monitoring a change in cell phenotype or the interaction between 
said polypeptide and a natural binding partner. 
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31. A method for treating a disease or disorder by administering to a patient in 
need of such treatment a substance that modulates the activity of a kinase selected from 
the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID 
NO: 125, SEQ ID NO:126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID 

5 NO:130, SEQ ED NO:131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID 
NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 13 8, SEQ ID NO: 139, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID 
NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 

10 NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID 
NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID 
NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID 

15 NO: 180, SEQ ED NO: 1 81, SEQ ID NO: 182, SEQ ID NO:l 83, SEQ ID NO:l 84, SEQ ID 

NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:l 88, SEQ ID NO:l 89, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID 
NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO.202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 

20 NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ DO NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

25 NO:230, SEQ ED NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID 
NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID 
NO:240, SEQ EDNO:241, and SEQ EDNO:242. 

32. The method of claim 31 , wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

30 neurodegenerative disorders, and cancer. 

33. The method of claim 3 1 , wherein said substance modulates kinase activity 

in vitro. 
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34. The method of claim 33, wherein said substance is a kinase inhibitor. 

35. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) contacting said sample with a nucleic acid probe which hybridizes 
5 under hybridization assay conditions to a nucleic acid target region of a kinase polypeptide 

selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33 , SEQ ID NO: 1 34, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO. 139, 

10 SEQ ID NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO:l 50, SEQ ED NO:l 5 1 , SEQ ED NO:152, SEQ ED NO: 1 53, SEQ ID NO: 1 54, 
SEQ ED NO:155, SEQ ED NO:156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, 
SEQ ED NO: 1 60, SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63 , SEQ ED NO: 1 64, 

15 SEQ ED NO: 165. SEQ ED NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, 

SEQ ED NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ID NO:l 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO:l 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 

20 SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93 , SEQ ID NO: 1 94, 
SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 
SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ED NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ED NO:2 1 0, SEQ ED NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ID NO:2 1 3 , SEQ ID NO:2 1 4, 

25 SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, 
SEQ ID NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ED NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, 

30 SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242, said probe comprising the 

nucleic acid sequence encoding said polypeptide, fragments thereof, or the complements 
of said sequences and fragments; and 
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(b) detecting the presence or amount of the probe: target region hybrid 
as an indication of said disease. 

36. The method of claim 35, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

5 neurodegenerative disorders, and cancer. 

37. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) comparing a nucleic acid target region encoding said kinase 
polypeptide in a sample, wherein said kinase polypeptide is selected from the group 
10 consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
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SEQ 


ID 


NO:200, 


SEQ 


E)NO:201, 


SEQ 


ID NO:202, 


SEQ 


ID 


NO:203, SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID NO:206, 


SEQ 


ID NO:207, 


SEQ 


ID 


NO:208, SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


IDNO:211, 


SEQ 


EDNO:212, 


SEQ 


ID 


NO:213,SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


H)NO:216, 


SEQ 


E)NO:217, 


SEQ 


ID 


NO:218, SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


E)NO:221, 


SEQ 


ID NO:222, 


SEQ 


ID 


NO:223, SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID NO:226, 


SEQ 


ID NO:227, 


SEQ 


ID 


NO:228, SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


IDNO:231, 


SEQ 


IDNO:232, 


SEQ 


ID 


NO:233, SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 
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ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ 
ID NO:241, and SEQ ID NO:242, or one or more fragments thereof, with a control nucleic 
acid target region encoding said kinase polypeptide, or one or more fragments thereof; and 
(b) detecting differences in sequence or amount between said target 
5 region and said control target region, as an indication of said disease or disorder. 

38. The method of claim 37, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 
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FIGURE 1A 

SEO ID NO: 122 X69117_H BARK2_H 

MMDLEAVIJU3VSYLMAMEKSKATPAARASKRIVLPEPSIRSVMQKYLAERNEITFDKIFN 

OKI GFLLFKDFCLNE INEAVPQVKFYEE I KE YEKLDNEEDRLCRSRQI YDAY I MKELLSC 
SHPFSKQAVEHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFMESDKFTRFCQWKNV 
ELNIHLTMNEFSVHRIIGRGGFGEVYGCRKADTGKMYAMKCLDKKRIKMKQGETLALNER 
IMLSLVSTGDCPFIVCMTYAFHTPDKLCFILDLMNGGDLHYHLSQHGVFSEKEMRFYATE 

I ILGLEHVHNRFWYRDLKPANILLDEHGHARI SDLGLACDFSKKKPHASVGTHGYMAPE 
VLOKGTAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVELPDTFSPE 
LKSLLEGLLQRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLIPPRGEVNAA 
DAFDIGSFDEEDTKGIKLLDCDQELYKNFPLVISERWQQEVTETVYEAVNADTDKIEARK 
RAKNKQLGHEEDYALGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQNL 
LTMEQ I L S VEETQ I KDKKC I LFRI KGGKQFVLQCESDPE FVQWKKELNETFKEAQRLLRR 
APKFLNKPRSGTVELPKPSLCHRNSNGL 

9FO ID NO- 123 AA144574 M BARK2_M 

CFVVYRDLKPANILLDEYGHVRISDLGLACDFSKKKPHASVGTHGYMAPEVLQKGTCYDS 
SADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVQLPDAFSPELRSLLEGLLQ 
RDVSQRLGCGGGGARELKEHIFFKGIDWQHVYLRKYPPPLIPPRGEVNAADAFDIGSFDE 

EDTKGI KLLDCDQDLYKNFPLVI S ERWQQE WET I YDAVNADTDKI EARKKAKNKQLGQE 
EDYAMGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQSLLTMEQIMSVE 
ETQIKDRKCILLRIKGGKQFVLQCESDPEFAQWLKELTCTFNEAQRLLRRAPKFLNKPRA 

AILEFSKPPLCHRNSSGL 



c E 0 ID NO- 124 AA826850 H 

MGSSMSAATARRPVFDDKEDVNFDHFQILRAIGKGSFGKVCIVQKRDTEKMYAMKYMNKQ 

oc^r^eWfreleilqeiehvflvnlv^sfqdeedmfmvvdlllggdlryhi^qnvq 

FSEDTVRLY I CEMALALDYLRGQHI I HRDVKPDNI LLDERGHAHLTDFNI AT 1 1 KDGERA 
TALAGTKP YMAPE I FX S F VNGGTG Y S FE VDWW S VGVMAYELLRGW R P YD I H S SNAVE S LV 
OLFSTVSVQYVPTWSKEMVALLRKLLTVNPEHRLSSLQDVQAAPALAGVLWDHLSEKRVE 
PGFVPNKGRLHCDPTFELEEMILESRPLHKKKKRLAKNKSRDNSRDSSQSENDYLQDCLD 
AIQQDFVIFNREKLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCGPICPSAGSG 

e?FO ID NO- 125 AA960957 H 

MGGNHSHKP PVFDENEEVNFDHFQI LRAI GKG S FGKVC I VQKRDTKKMYAMKYMNKQKC I 

eSr^fIelqimqglehpflvnlwysfqdeedm 

GT VKL YICELALALE YLQR YH IIHRDIKPDNI LLDEHGHVH I TDFN I AT WKGAERAS SM 

aotS^pevfqvymdrgpgysypvdwwslgitayellrgwrpyeihsvtpideilnmf 

KVER^STWCKGMVALLRKLLTKDPESRVSSLHDIQSVPYLADI^DAVFKKALMPGF 

wnkSIncdptfeleemileskplhkkkkrlaknrsrdgtkdscplnghlqhcletvre 
efi I fnreklrrqqgqgsqlldtdsrgggqaqsklqdgcnnnllthtctrgcss 



mqstsLLllsdilgqgaWfrgrhkjctgdlfaikvfnnisflrpvdvqmre^ 

KLNHKNIVKIjFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDVV 

SSfSivhrdikpgnimrvigedg^ 



WCYACRIASTLLLYOELMRKGIRWLIELIKDDYNETVHKKTEVVITLDFCIRNIEKTVK 
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FIGURE IB 

Ess"" 

GGLRNVDCL 

c EO ID NO: 127 AA305176_H nocT _ p c IVKPIS RGAFGKVYLGQKGGKLYA 
t t H IYGYFDEEMAVKYISEVAIALDYL^GI1»»^ GFNTP IAEKNQDPANILSACLS 

SSqshtfissvesechsspkwekdcqv 

-s— PEESEVBWE^PEV 

pQPDDETDTSYFEAMJNAQHLTVSGFSL 




;^„PEEEES == ^ 



KKDTLTEEETQFYI* 



<!TO ID NO: 130 RA210825_H CO „_ OTOR , LPS TPKDPMLRSPPW« SFI ' G 

dS L ™tplp}pwpvgs^e^^ 



™Sfcdhcgemleg^^ 



PLSEILTVESAQNFSLVPPGTNPHCFEIVTANA^ 

etSrqalmpvilqdapsapghaph^ 

QFGWYGGKHRKTGRDVAVKVIDKLRFPTKQESQ^ 




MYVSL 

SLSHPWLQEYQTWLDLREL 
CPPQDHDMQGLAERISVL 



^ ID 13 Mn^GLSDPYTKITNLSQKTKVIKKTLTPTV^ETFFVHFPEKTTLEL 
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FIGURE 1C 



MQEPSKRIPSWSGRPIV^iOWM^CTPmFRWSYTRPTI SDSSRGLDDT 
EEPSPPEDKMFFLDPSDLDVERDEEAVKTISPSTSNNIPL.™ V 

SNPHCFE 1 1 TDTMVYFVGENNGDSSHNPVLAATG FG j wggkHRKTGR 

CTSPGQGKDHKDLSTS I SVSNCQI QENVDI STOTQI F „__ pER VFWM EKLHGDML 
™IDKMRFPTKQESQ = ILWLHHPG ™^™ SAE p FP QVKLCD 
EMILSSEKSRI-PERITKFHVTOILVR^NLHFraiV^DL SGTFPPNE DE 

SSS^sSlHAYTHN^VVPKHFIMAPNPDD^EDP 

SEQ ID NO: 133_PKNBETA_H ^x-r^WLRRVATDRRHLGHVQQLLRSS 

mSgaprqpgpsqwppedekevir^ 

NRRLEQLHGELRELHARILLPGPG^^ 

^RARELEIGVHWRDWRQLCGVAFLRLEDFLDNACHO^SLbbV^ RT 
rRpfLQRQBRIPSKRR = ^^^ 

PTTLREASDPATPSNFLPKKTPLGEEMTPPPKPPR TGKYYAIKA LKKQEVLSRDE 

RGPSPPASPTRKPPRDQDFRC^VL^ 

IESLYCEKRILEAVGCTGHPFLLSLLVCl^ 

arfyvacwlglqflhekk = ^ 
gtpefiapevltoeaytqawwwalgvl^^ 

SEQ ID NO: 134_AI021023JJ P^TA M ^^ytraVDWHG 

lkvojllldaqgflkia^fglc^ 
hslltarqqaafrdfdfvserflep 

RLFAftELVLVICTUn)MI>HRD^I^R™L^ THSDSEIPAS 

S L ,HE=— ^ 
SAETMPFDDFDCDLESFLLYPIPA 
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FIGURE ID 

SEO ID NO: 136 AA476563_H 

MEFFRIDSKDSASELLGLDFGEKLYSLKSEPLKPFFTLPDGDSASRSFNTSESKVEFKAQ 
DTI SRGSDDSVPVI SFKDAAFDDVSGTDEGRPDLLVNLPGELESTREAAAMGPTKFTQTN 
IGIIENKLLEAPDVLCLRLSTEQCQAHEEKGIEELSDPSGPKSYSITEKHYAQEDPRMLF 
VAAVDHSSSGDMSLLPSSDPKFQGLGWESAVTANNTEESLFRICSPLSGANEYIASTDT 
LKTEEVLLFTDQTDDLAKEEPTSLFQRDSETKGESGLVLEGDKE IHQI FEDLDKKLALAS 

RF Y I PEGC I QRWAAEMWALDALHREG I VCRDLNPNN I LLNDRGH I QLTY F S RW S E VED S 
CDSDAIERMYCAPEVGAITEETEACDWWSLGAVLFELLTGKTLVECHPAGINTHTTLNMP 

ECVSEEARSLIQQLLQFNPLERLGAGVAGVEDIKSHPFFTPVDWAELMR 
SEO ID NO: 137 AA626690_H 

MLPFAPQDEPWDREMEVFSGGGAS SGEVNGLKMVDEPMEEGEADSCHDEGWKE I P I THH 
VKEGYEKADPAQFELLKVLGQGSFGKVFLVRKKTGPDAGQLYAMKVLKKASLKVRDRVRT 
KMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDVFTRLSKEVLFTEEDVKFYLA 

ELALALDHLHQLG I VYRDLKPENI LLDE I GH I KLTDFGLSKES VDQEKKAY S FCGT VEYM 
APEWNRRGHSQSADWWSYGVLMFEMLTGTLPFQGKDRNETMNMILKAKLGMPQFLSAEA 
OSLLRMLFKRNPANRLGSEGVEEIKRHLFFANIDWDKLYKREVQPPFKPASGKPDDTFCF 
DPEFTAKTPKDSPGLPASANAHQLFKGFSFVATSIAEEYKITPITSANVLPIVQINGNAA 

OFGEVYELKEDIGVGSYSVCKRCIHATTNMEFAVKI IDKSKRDPSEEIEILMRYGQHPNI 
ITLKDVFDDGRYVYLVTDLMKGGELLDRILKQKCFSEREASDILYVISKTVDYLHCQGW 
HRDLKPSNILYMDESASADSIRICDFGFAKQLRGENGLLLTPCYTANFVAPEVLMQQGYD 
AACDIWSLGVLFYTMLAGYTPFANGPNDTPEEILLRIGNGKFSLSGGNWDNISDGAKDLL 
SHMLHMDPHQRYTAEQILKHSWITHRDQLPNDQPKRNDVSHWKGAMVATYSALTHKTFQ 

PVLEPVAASSLAQRRSMKKRTSTGL 
SEO ID NO: 138 AA215680_H 

MSLVACECLPSPGLEPEPCSRARSQAHVYLEQIRNRVALGVPDMTKRDYLVDAATQIRLA 

LERDVS EDYEAAFNHYQNGVD VLLRG IHVDPNKERREAVKLKI TKYLRRAEE I FNCHLQR 
PLSSGASPSAGFSSLRLRPIRTLSSAVEQLRGCRWGVIEKVQLVQDPATGGTFWKSLP 

RCHMVSRERLTI I PHGVPYMTKLLRYFVSEDS I FLHLEHVQGGTLWSHLLSQAHSRHSGL 
SSGSTOERMKAQLNPHLNLLTPARLPSGHAPGQDRIALEPPRTSPNLLLAGEAPSTRPQR 
EAEGEPTARTSTSGSSDLPKAPGGHLHLQARRAGQNSDAGPPRGLTWVPEGAGPVLGGCG 
RGMDOSCLSADGAGRGCGRATWSVREEQVKQWAAEMLVALEALHEQGVLCRDLHPGNLLL 
DOAGHIRLTYFGQWSEVEPQCCGEAVDNLYSAPEVGGISELTEACDWWSFGSLLYELLTG 
MALSQSHPSGIQAHTQLQLPEWLSRPAASLLTELLQFEPTRRLGMGEGGVSKLKSHPFFS 

TIQWSKLVG 

SEO ID NO: 13 9 SGK H 

MTVKTEAAKGTLTYSRMRGMVAILIAFMKQRRMGLNDFIQKIANNSYACKHPEVQSILKI 
SOPOEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHKAE 
EVFYAVKVLQKKAILKKKEEKHIMSERNVLLKNVKHPFLVGLHFSFQTADKLYFVLDYIN 
GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNIVYRDLKPENILLDSQGHIVLTD 
FGLCKENIEHNSTTSTFCGTPEYLAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSR 
NTAEMYDNILNKPLQLKPNITNSARHLLEGLLQKDRTKRLGAKDDFMEIKSHVFFSLINW 
DDLINKKITPPFNPNVSGPNELRHFDPEFTEEPVPNSIGKSPDSVLVTASVKEAAEAFLG 

FSYAPPTDSFL 

ciFO ID NO: 140 AA107515 M 

MTVKAEAARSTLTYSRMRG^AlIlAFMKQRRMGLNDFIQKIASNTYACKHAEVQSILKM 
SHPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHKAE 
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FIGURE IE 

SYAPPVDSFL 

SEQ ID NO: 141_AA109508_M mttvptyt KPENILLDCQGHWLTDFGLCKE 

ILDC 

SEQ ID NO: 142 ^877 83 H LVSVGRSEWFV FRRYAEFDKLYNT 

DFVNGGEGHWLTDFGLCKEGIAISD^ 
VNASVLEADDAFVGFSYAPPSEDLFL 

LFGTVKDDLSFAGYQKHI^SCAAPAPLTSAEREWQIRl SRV PRDAARYHFFL 



SEQ ID NO: 144_H6U^13 n c G ^AKRAGPFILGPRLGNSPVPSIVQCLAR 
MSKLRMKRRASDRGAGETSARAKALGSGISGNNAKRAOF GW HHHGLFQD 

DAQPMTSLDTAILAQRYLRK 

SEQ ID NO: 145_SGK324 H SSGpKGNGLI p S PAHSAHCSFYRTR 
MASTRSIELEHFEERDKRPRPGSRRGAPSSSGGSSSSGPK 

tlqalssekkak^rfyrngdry™ 

RTIYTIDGSRKVTSLDELLEGESYVCASNE^FR 

V«5> 
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FIGURE IF 

RDIKPENLLVCEYPDGTKSLKLGDFG^IWEGPLYT pYVJDNITDSAK ELISQMLQ 
GRRQVWPDCGAGLEVFELGSRELPSHGSWCLP 

TNLADLKNRGQYVAAGFERFHKLPPYQA^ 

klltekvklqsgavrlctlegl^ 

GNEAHLRSGVGTVAGSPKPLGRKAK^TCL^ 

phrrketagalevaddedtqteepldqraaq^ 

qddfvldhsrrrllrehqagfe^rr^ee^aekekk GNFAVVKEC rhretrqaya 

EPKTRPEENKPERPSGRKPRPMGIIAA^El^ 

MKI I DKSRLKGKEDMVDSE I LI ^^^g^^^LKP^NLLVQRNEDKSTTLjKLADFGLA 
IESVKFPEFDAAI*IMDI^ 

KRQKQVSPSSDGHFRSQHKRWEQVS 

SEQ ID NO: 14 8_AA197883 M HSLKCLSSKIS ERKLPGPWLPAGR 
RHQGKTSTVI^PEDKARAQKWVRGKQE^ 

AMKMIDKSQLKGKEDIVDSEILIIQSLSHPNIVKLH EDKS x TLKLAD FGL 

IVENVKFPE PEAAVM I ^^^^^y^^y ^g^^^AGVI LY I LLCGF P P FRS PERDQDE 

NSQKEESPNSLGHFQSQHKKVAEQMP 

SEQ ID NO: 149 - D ^ K J-" Tl , MPNFNNFYIL TSKELGRGKFAVVRQCISKSTGQEYA 
MSRRRFDCRSISGLLTTTPQIPI^ENFNNFYIL^ 

AKFLKKRRRGQDCRAEILHEIAVLEIAKSCPRVINbH ssiypLGDIK IVDF 
LCLPELAEMVSENDV I RLI KQ I ^ p^YyX^^p ^^.^q^wN i G 1 1 AYMLLTHTS PFVGEDN 
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FIGURE 1G 

LFHPEETSSSSQTQDHSVRSSEDKTSKSSCNGTCGDREDKENIPEDSSMVSKRFRFDDSL 
PNPHELVSDLLC 

SEQ ID NO: 150_W44160 M DRAK2^M GKFAWRQ CISKSTGQEYA 
MSR^RFDCRSVSGLLTTTPQTPI™ 

AKSLKKRRRGQDCRAEILHEIAVLE^SCP^INI^BV ypLGDIKIVDF 

LCLPELAEMVSENDV I RL I KQ I ^* ^Y^^pp^^^Q^yjjq j g^[ I AYMLLTHT S P FVGEDN 
GMSRKIGNASELREIMGTPEYLAPEILNYDPITTO 

PSPHELVPDLFC 

SEQ ID NO: 151_H01248 H, DRAK1 H ^ GLLXEIRAV VRTEPFQDGYS 
MIPLEKPGSGGSSPGATSGSGRAGRGLSGPCRPPPPPQ^ 

lcpgrelgrgkfawrkcikkdsg^™™^ 
vinlhevyetase^lvleyaagge^d^ 

dwhldlkpqnilltsesplgd^^ 
s^tdmwsigvltywltgispflOT 

.EKTQQQHMLYQQEQHHQILQQQIQDSICPPQ^S^ 

^^xr^TMHT.VROPSNSPPPMSSAMlQPHGAAbb^yrv prlcimMnHnHRTNL 




MATLSYGHRPLSKQLSADSAEAHSLNWRtbF spT pp DYTRH QQVPHILQ 

ihpyghqptaafsknkvpsrepvigncmdr^spgq^ 

VVREITWKRIGFTOIRGEI^FSQV^H^KV SEE , ESKLIFSQI 
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FIGURE 1H 

^^^^^^^^^ 

RVYRGIRHTSKFCSIL 




SKIANELKL 

SEQ ID NO: 155_W9083 9_M trttgKGNFAKVKLARHILTGREVAIK 
KGPSWSSRSLGARCRNSIASCPEEQPHVGNYRLLR^ 

IIDKTQI^PSSLQKLFREVRIMKG^ 

shgrmkekearakfrqivsavhychqkni™dlkaen^ SLPFDGH nlkelrerv 

SSdtfcgsppyaapelfqgkkydgpe^ 

lrgkyrvpfymstdcesilrrfl^^ 

RMPGRKASCSAVGSGSRGLPPSSPMVSSAHNPNKAE RSRLARGST IRSTFH 

? Y vcterpgserpsllpngkenssgt^ppaspssh^ 

f^G^^ 

RTALSEDRWSSYCLSSLAAQNICTSK^HCPA^PEHTDPS^ 
PLLPAPVCNPNKAIFTVDAKTT^ 

ALSEEHMEADGHAAWFGTVVDIITOSGEKIPVSV^ GQH IPKNLKIQRSV 

WAFQSDGTITSCDSLFAHLHGYVSG^ 
^DGTTFPLSLKLKSQPSSEEATTGEAAP^GYP^ 

DLPG^EAVEAQEVDVNSFSWNL^ 

VG S LQEQG S ^^^^^^ ^ pE^^Sy STPV I VMRGAAGLQREI QEGAY S G S C YH 

SSSSSSSS5 

8/ 10 
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FIGURE 11 

HPGDPELLTS 

SEO ID NO: 157_AAS4483S M 406786 M A TIKUDFGSMYL 

TRPHPCLDEPLkSFIFROLVSAVGYLHSQGIIHRDIKDEN pFCEVEETHEAV 

^pk^kgpS 

PKVNGCLLDPVPPVLWKGCQSLPMTSIMGLETE FLEDNPSLKDIML 

RRHTL SEVTNQL WMPG AGKI ^ qKREVTWRS PV S FREGRRASDT S LTQG I VAFRQ 

ANQPSPRMTSPFISLRPTNPAMQALSS^ 

HLQNIARTKGILEI^KVQLL^ 
LPASVHPQLSPRQSLETQYLQHRLQKPSLLS^Q^ 

RLFLQKQSQLQAYFNQMQIAESSYPQPS^ 
SELPGLFDCEMLDAVDPQHNGYVLVN 

SEQ ID NO: 159_AA2 07220 H KKQAVKRH HHKHNLRHRYEFLETLG 
MESLVFARRSGPTPSAAEIARP^LIKSPKPLMKK^ SSLNHPH IIAIHEVFE 

KGTY GKVKKARE S S GRLVA I g ™ ^^^p p^Qj v S AVH YCHQNR WHRDLKLEN 

nsskivivmeyasrgdlydyiserqqlserear™ 

ILLDANGNIKIADFGLSNLYHQGKFLQTFCGSPLYA 
YILVHGTMPFDGHDHKILVKQISNGAYREPP^K 

hwwvnvjgyatrvgeqeaphegghpgsdsaka 

GGGSTTPGLERQHSLKKSRKEN^^ 

lgdscfsltdcqevtatyrqalrvcsklt 

SEQ ID NO: 160_AA426580_H MAK V H vpRERLRDFQH HKRVGN 

Tpaaagdgllgepaapgggggae^ 

YLIGBRKLGEGSFAK^EG^VLTGE^AIKVIU SEARRYIRQL iSAVEHLHRA 

pnitqlldiletensyylvmelcpggnlmhkiyekkr^ GSPAYAA pellarkky 
g^rdlkienllldednni^idfg^ 

SpTp= 

V I'!) 
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FIGURE 1J 

DVINTVLSNRACHILAIYFLLNKKLERYLSGKSDIQDSLCYKTRLYQIEKYRAPKESYEA 
SLDTWTRDLEFHAVQDKKPKEQEKRGDFLHRPFSKKLDKNLPSHKQPSGSLMTQIQNTKA 
LLKDRKASKSSFPDKDSFGCRNIFRKTSDSNCVASSSMEFIPVPPPRTPRIVKKPEPHQP 
GPGSTGIPHKEDPLMLDMVRSFESVDRDDHVEVLSPSHHYRILNSPVSLARRNSSERTLS 
PGLPSGSMSPLHTPLHPTLVSFAHEDKNSPPKEEGLCCPPPVPSNGPMQPLGSPNCVKSR 
GRFPMMGIGQMLRKRHQSLQPSADRPLEASLPPLQPLAPVNLAFDMADGVKTQC 

SEQ ID NO: 161_Z36720_H 

MDTKLNMLNEKVDQLLHFQEDVTEKLQSMCRDMGHLERGLHRLEASRAPGPGGADGVPHI 
DTQAGWPEVLELVRAMQQDAAQHGARLEALFRMVAAVDRAIALVGATFQKSKVADFLMQG 
RVPWRRGSPGDSPEEWVKEEEVCFMPPVPPAPGAAGQSLQKDKGELSAEQGIWATLMTLV 
I MVTAANKERVEEEGGKPKHVLSTSGVQSDAREPGEESQKADVLEGTAERLPP I RASGLG 
ADPAQAWSPGQGDGVPGPAQAFPGHLPLPTKVEAKAPETPSENLRTGLELAPAPGRVNV 
VSPSLEVAPGAGQGASSSRPDPEPLEEGTRLTPGPGPQCPGPPGLPAQARATHSGGETPP 

RAALLKGAVAPGFSRRDLVFPS I FCACLGI S IHIQEMDTPGEMLMTGRGSLGPTLTTEAP 
AAAQPGKQGPPGTGRCLQAPGTEPGEQTPEGARELSPLQESSSPGGVKAEEEQRAGAEPG 
TRPSLARSDDNDHEVGALGLQQGKSPGAGNPEPEQDCAARAPVRAEAVRRMPPGAEAGSV 
VLDDSPAPPAPFEHRWSVKETSISAGYEVCQHEVLGGGRFGQVHRCTEKSTGLPLAAKI 

I KVKS AKDREDVKNE INI MNQL SHVNL I QLYDAFE S KH S CTLVME YVDGGELFDR I TDEK 
YHLTELDWLFTRQICEGVHYLHQHYILHLDLKPENILCVNQTGHQIKI IDFGLARRYKP 
REKLKVNFGTPEFLAPEWNYEFVSFPTDMWSVGVITYMLLSGLSPFLGETDAETMNFIV 
NCSWDFDADTFEGLSEEAKDFVSRLLVKEKSCRMSATQCLKHEWLNNLPAKASRSKTRLK 

SQLLLQKYIAQRKWKKHFYWTAANRLRKFPTSP 
SEQ ID NO: 162_SGK088_H 

GEMALFECLVAGPTDVEVDWLCRGRLLQPALLKCKMHFDGRKCKLLLTSVHEDDSGVYTC 
KLSTAKDELTCSARLTVRPSLiAPLFTRLLEDVEVLEGRAARFDCKI SGTPPPWTWTHFG 
CPMEESENLRLRQDGGLHSLHIAHVGSEDEGLYAVSAVNTHGQAHCSAQLYVEEPRTAAS 
GPSSKLEKMPSIPEEPEQGELERLSIPDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISW 
FHNGHRIQSSDDRRMTQYRDVHRLVFPAVGPQHAGVYKSVIANKLGKAACYAHLYVTDW 
PGPPDGAPQWAVTGRMVTLTWNPPRSLDMAIDPDSLTYTVQHQVLGSDQWTALVTGLRE 

PGWAATGLRKGVQH I FRVL STTVKS S SKPS PPSEPVQLLEHGPTLEEAPAMLDKPD I VYV 
VEGQPASVTVTFNHVEAQWWRSCRGALLEARAGVYELSQPDDDQYCLRICRVSRRDMGA 
LTCTARNRHGTQTCSVTLELAEAPRFESIMEDVEVGAGETARFAVWEGKPLPDIMWYKD 
EVLLTESSHVSFVYEENECSLWLSTGAQDGGVYTCTAQNLAGEVSCKAELAVHSAQTAM 
EVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRIVERSSGLEFAAKFIPSQAKPKAS 

ARREARLLARLQHDCVLYFHEAFERRRGLV I VTELCTEELLERI ARKPTVCES E I RA YMR 
OVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQVRICDFGNAQELTPGEPQYCQYGTP 
EFVAPEI VNQSPVSGVTDI WPVGWAFLCLTGI SPFVGENDRTTLMNI RNYNVAFEETTF 
LSLSREARGFLIKVLVQDRLRPTAEETLEHPWFKTQAKGAEVSTDHLKLFLSRRRWQRSQ 
ISYKCHLVLRPIPELLRAPPERVWVTMPRRPPPSGGLSSSSDSEEEELEELPSVPRPLQP 
EFSGSRVSLTDIPTEDEALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA 
GASPRRGELRRGSSAESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEG 
EYAQRLQALRQRLLRGGPEDGKVSGLRGPLLESLGGRARDPRMARAASSEAAPHHQPPLE 
NRGLQKSSSFSQGEAEPRGRHRRAGAPLEIPVARLGARRLQESPSLSALSEAQPSSPARP 
SAPKPSTPKSAEPSATTPSDAPQPPAPQPAQDKAPEPRPEPVRASKPAPPPQALQTLALP 
LTPYAQIIQSLQLSGHAQGPSQGPAAPPSEPKPHAAVFARVASPPPGAPEKRVPSAGGPP 
VLAEKARVPTVPPRPGSSLSSSIENLESEAVFEAKFKRSRESPLSLGLRLLSRSRSEERG 
PFRGAEEEDGIYRPSPAGTPLELVRRPERSRSVQDLRAVGEPGLVRRLSLSLSQRLRRTP 
PAQRHPAWEARGGDGESSEGGSSARGSPVLAMRRRLSFTLERLSSRLQRSGSSEDSGGAS 

H if 3 ) 
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FIGURE IK 

GRSTPLFGRLRRATSEGESLRRLGLPHNQLAAQAGATTPSAESLGSEASATSGSSAPGES 

RSRLRWGFSRPRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHIKLKDQVLLEGEAA 

TLLCLPAACPAPHISWMKDKKSLRSEPSVIIVSCKDGRQLLSIPRAGKRHAGLYECSATN 

VLGSITSSCTVAVARVPGKLAPPEVTQTYQDTALVLWKPGDSRAPCTYTLERRVDGESVW 

HPVSSGIPDCYYNVTHLPVGVTVRFRVACANRAGQGPFSNSSEKVFVRGTQDSSAVPSAA 

HQEAPVTSRPARARPPDSPTSLAPPLAPAAPTPPSVTVSPSSPPTPPSQALSSLKAVGPP 

PQTPPRRHRGLQAARPAEPTLPSTHVTPSEPKPFVLDTGTPIPASTPQGVKPVSSSTPVY 

WTSFVSAPPAPEPPAPEPPPEPTKVTVQSLSPAKEWSSPGSSPRSSPRPEGTTLRQGP 

PQKPYTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHER 

IMSLHEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATYMVQLLQGLDYLHGHHV 

LHLDIKPDNLLLAPDNALKIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEMVKGEPIGSA 

TDIWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLSV 

HPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLLR 

SYPGGP 

SEQ ID NO: 163_AA542015_M SGK088_M 

ATDIWGAGVLTYIMLSGYSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLS 
VHPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLL 

RSYPGSP 

SEQ ID NO: 164_R19772_H 

MKGGDRAYTRGPSLGWLFAKCCCCFPCRDAYSHSSSENGGKSESVANLQAQPSLNFIHSS 
PGPKRSTNTLKKWLTSPVRRLNSGKADGNIKKQKKVRDGRKSFDLGSPKPGDETTPQGDS 
ADESKKGWGEDEPDEESHTPLPPPMKIFDNDPTQDEMSSSLLAARQASTEVPTAADLVNA 
IEKLVKNKLSLEGSSYRGSLKDPAGCLNEGMAPPTPPKNPEEEQKAKALRGRMFVLNELV 
OTEKDYVKDLGIWEGFMKRIEEKGVPEDMRGKDKIVFGNIHQIYDWHKDFFLAELEKCI 
QEQDRLAQLF I KHERKLH I YVWYCQNKPRSEY I VAEYDAYFEEVKQE INQRLTLSDFLI K 
PIORITKYQLLLKDFLRYSEKAGLECSDIEKAVELMCLVPKRCNDMMNLGRLQGFEGTLT 
AQGKLLQQDTFYVIELDAGMQSRTKERRVFLFEQI VI FSELLRKGSLTPGYMFKRS I KMN 
YLVLEENVDNDPCKFALMNRETSERWLQAANADIQQAWVQDINQVLETQRDFLNALQSP 
IEYQRKERSTAVMRSQPARLPQASPRPYSSVPAGSEKPPKGSSYNPPLPPLKISTSNGSP 
GFEYHQPGDKFEASKNDLGGCNGTSSMAVIKDYYALKENEICVSQGEWQVLAVNQQNMC 
LVYOPASDHSPAAEGWVPGSILAPLTKATAAESSDGSIKKSCSWHTLRMRKRAEVENTGK 
NEATGPRKPKDILGNKVSVKETNSSEESECDDLDPNTSMEILNPNFIQEVAPEFLVPLVD 

VTCLLGDTV I LQCKVCGRPKPTI TWKGPDQN I LDTDNS SATYTVS S CD SGE I TLKI CNLM 
PODSGIYTCIATNDHGTTSTSATVKVQGVPAAPNRPIAQERSCTSVILRWLPPSSTGNCT 
ISGYTVEYREEGSQIWQQSVASTLDTYLVIEDLSPGCPYQFRVSASNPWGISLPSEPSEF 
VRLPEYDAAADGATI SWKENFDSAYTELNE I GRGRFS I VKKC IHKATRKDVAVKFVNKKM 
KKKEQAAHEAALLQHLQHPQY I TLHDT YE S PTS Y I LI LELMDDGRLLDYLMNHDELMEEK 
VAFYIRDIMEALQYLHNCRVAHLDIKPENLLIDLRIPVPRVKLIDLEDAVQISGHFHIHH 
LLGNPEFAAPEVIQGIPVSLGTDIWSIGVLTYVMLSGVSPFLDESKEETCINVCRVDFSF 
PHEYFCGVSNAARDFINVILQEDFRRRPTAATCLQHPWLQPHNGSYSKIPLDTSRLACFI 

ERRKHQNDVRPI PNVKSYI VNRVNQGT 
SEO ID NO: 165 5R72_8_2_H 

MADSGLDKKSTKCPDCSSASQKDVLCVCSSKTRVPPVLWEMSQTSSIGSAESLISLERK 
KEKNINRDITSRKDLPSRTSNVERKASQQQWGRGNFTEGKVPHIRIENGAAIEEIYTFGR 
I LGKGS FG I V I EATDKETETKWAI KKVNKEKAGSS AVKLLEREVN I LKS VKHEH 1 1 HLEQ 
VFETPKKMYLVMELCEDGELKEILDRKGHFSENETRWI IQSLASAIAYLHNNDIVHRDLK 
LENIMVKSSLIDDNNEINLNIKVTDFGLAVKKQSRSEAMLQATCGTPIYMAPEVISAHDY 
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FIGURE 1L 

SQQCDIWSIGVVMYMLLRGEPPFLASSEAKLFELIRKGELHFENAVWNSISDCAK^LKQ 
LMKVDPAHRITAKELLDNQWLTGNKLSSVRPTNVLEMMKEWKNNPESVEENTTEEKNKPS 

teeiSk^qpwgiwpetnytsdeeeekqstayekqfpatskdnfdmcsssftsskllpae 

I KGEMEKTPVTPSQGTATKYPAKSGALSRTKKKL 

aS^qSk™Levavlkklqgsglgqgdgkee M mkpgakrgkdhvcrfigcgrne 

X™qlqgrniadlrrsqprgtftlsttlrlgkqilesieaihsvgfl^dik^ 

amgrlpstyrkcymldfglarqytnttgdvrpprnvagfrgtvryasvnahknremgrhd 

DLWSLFYMLVEFAVGQLPWRKI KDKEQVGMI KEKYEHRMLLKHMPSEFHLFLDHIASLDY 

ftkpdyqlimIvfS 

gqcdasawgpapLhrgcatgrapexpgectpnsarealxgagpq 
ggdrcqpeqtpdqhrqsncrqgegrgwpflsppipslvplpcssxapcpppisllarplf 

pvpspalaslclpsssssvsftlrrpsa 

msgggeqldilsvgT^kerwkvlrkiggggfgeiydaldm 
SeSkLqgkdhvcrfigcgrndrfnyvvmqlqgrni^dlrrsqsrgtf 
^oilesiesihsvgsxhrdikpsnfamgrfpstcrkcymldfglarqftnscgdvrpp 
^vagfrgtvryaLnahrnremgrhddlwslfymlvefwgqlpwrki^ 

gndStStLtpqlhtrltpaaigianatpipgdllrentdevfpde 

SspdSpgsLhprpqekdweemdanknkiklgickaateeenshgq^ 

gsSr^eSqpd^iplvrklrsihsfelekrltle 

TFTI ffemtvfdleak^arggsnllmdsvs SFQLF^ 

lSsh™ki^fgiaraksipsqtyssewtlwyrppdallgateysseldiwgag^ 

fiemfqgqp^vsnileqlekiwevlgvptedtwpgvsklp^ 
™r[gr?peaedlasqmlkgfprdrvsaqealvhdyfsalpsqlyqlpdeeslftvsgv 

rlkpemcdllasyqkghhpaqfskcw 

ay^atrwyrapellvgdtqygssvdiwaigcvfaelltgqplwpgksdvdqlylii^ 
^gk^prhqsifksngffhgisipepedmetleekfsdvhpvalnfmkgclkmnpddrl 

tcsqllessyfdsfqeaqikrkarnegi^ 

FDHLPNI 

KTALREI^LKLI^P^WLIEVF^ 

RME^^OTT^GSSVDVWAVGCVFAELLTGQPLWPGKSDVDQLYLIIRTLGKLIP^QS 

^Snqffrgisipepedmetleekfsi^qpvalsfmkgclkmnpderltcaqlldsayf 
esfqedqmkrkarsegrsrrrqqnqllplipgshisptpdgrkqvvqlkfdhlpni 

^p^„™ H PqvvdqsggASGTLQPS^ 
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FIGURE 1M 

O^KOTEKEKSOEVSSK— 

KEKTRKERELKSOTKDRS^HRraETPKSYWVDSPKR SSTRSPS p YS RRQR 

SsrhShskkkrsssrsrhssispvrlp^l^^ 

KGSPVFLPRKENSSVEAKDSGLESKKLPRSVKLEKSRPC1 TpppL p TI ASPPPPI.P 

SSdlplppelpggdlsppdspepkai^ 

T THRSVVmKEIVTDKQDALDFKKDKGAFYLVtbX^ EE SRPYTNKVITLW 

YRPPELLLGEERYTPAIDVWSCGCILGELFTKKPJ.^ DHMLTLDPS KRCTAEQTLQSDFL 

OTPYFNTMKPKKQYRRRLRE™ 

KDVELSKMAPFDLPHWQD^ 

MCQPAPPOPAPGKVESGAGDAIGLADITQQLNQSbLA pvIL p SAEQM TLEAS 

KOTSSKEKELRKDERKTOTTNTI.LSSSVLGKEIEKEKKP ENpHCGGSVT MPPI 

SEO ID HO: 173_AA124976_M ^ o „_ 1FKFIpELRAK LLQEAK\mSFIKPKEN 
^^CMIDPAERTESTD^YF^EI^E^^^^ 

FKENEPVRDEKKSVFTOTIAYGNPSLYGKEVDRDKKft pHAGGCM1M ppiNLT 

spSfhvdRPIeesllnpelirpfipeg 
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FIGURE IN 

SEQ ID NO: ^^^"lSSOJJ rhfSAEGPFAFAEMRHSKRTHCPDW 
MI TS I STEKSGHTHYPFMI ^^^^™puuQp^£SDCHYLEARSLNERDYRDRRY 

VDEYRNDYCEGYVPRHYHRDIESG^ 

DTLGEGAFGKWECIDHGMDGMHVAVKIV^GRYREAARSbig^ INFLHH NKL 
^EWFDHHGHVCIVFELLGLSTYD^ 

tStdlkpenilfvksdywkyn— ^™XehSmmerilgpipqhmiq 

RI TLDEALQHPFFDLLKKK 

SEQ ID NO: 176_AA557536 H _ G pp V QSPCPGTDLSRQERNWPSWA 

mc^vvdprivrryllrrqlgqgrtfreitll^ 

PEHSPSWPSSRLRLSPQEFGDHPN^^ 

lqdvhvrsifyqllra^ 

PEDQAVTEYVATRWRAPEVLLSSHRYTASCPKXi LLRRLLVF APDKRLSATQAL 

stlhqleliletipppseexrprqtldallppdts^ 

EGHHV 

MOK H 




RNGIFHKDVKPENILIKQDV^^---"— - GTPfi0KIIjT KFKQSRM«FDFP 

lddldfspslsridlknkkrqsddtl^ 
qprSpgliprppaaqpvhgrtdwaskypsrr 

SEQ ID NO: l 79 _ AA839940 - M _ DTM7DnPTvrTTDLO QGIDPGAVSPEPGKDHAAQ 

Knnggmsaeeeigpgaepmrgps^trdwrd^ 
gpgrteagrvssaaeaai™^^ 
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FIGURE 10 

QTGHQIKI IDFGLARRYKPREKLKVNFGTPEFLAPEVWYEFVSFPTDMWSVGVITYMLL 
SGLSPFLGETDAETMNFIVNCSWDFDADTFKGLSEEAKDFVSRLLVKEKSCRMSATQCLK 
HEWLNHLPAKAS G SNVRLRSQQLLQKYMAQ S KWKKHFHWAAVNRLRKF PTCP 

<?FO ID NO- 180 AA460132 H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 

^SyLpalearlgrrrtvqearallrcrragisapwffvdyasnclymeeie^ 

TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV 
™isFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

c F n ID NO- 181 SGK034 H 

OREKVNQGNMPGLQiTFLAMDraEGVEVVVJNELHFGDRKAFAAHEEKIQTVFEQLVLVDH 

Ilsflhacsppiihgnltsdtifiqhnglikigsvwhrifsnalpddlrspiraereelr 
nlhffp^e^evadgtavdifsfgmcalemavleiqtngdtrvteeaiaparhslsdpnm 
refiIcSrdparrpsahsllfhrvlfevhslkllaahcfiqhqylmpei^eektkam 
dlhavlaelprprrpplqwrysevsfmeldkfledvrngiyplmnfaatrplglprvlap 
ppSv^ptpepfdsetrkviqmqcnlersedkarwhltlllvledrlhrqltydll 

ptdsaqdlaselvhygflheddrmklaaflestflkyrgtqa 

qpn ID NO- 182 AA103218 M SGK034_M 

Ssapeygevndgtgfvdifsfgmcalemavleiqangdtrvteeaiararhslsdpnmr 
Silsc^rdparrpsahnllfhrvlfe^ 

lSv^pqphgppmqwrysevsfleldkfledvrngiyplmnfaaarplglprvlapp 
pSIq^ktptpepfdsetrkwqmqcnlersedkarwhltlllvledrlhrqltydllp 

tdsaqdlaaelvhygflheddrtklaaflettflkyrgtqa 

qFO ID NO: 183 NEK7 H, N34132_H 

mIggaaekqsstpgslflsppapapkngsssdssvgeklgaaaadavtgrteeyrrrrht 
mdSSgaaatttttehrffrrsvicdsnatalelpglplslpqpsipaavpqsappeph 
reSwa^sqvaqqppaaaapgeqavagpapstvpsstskdrpvsqpslvgskeeppp 

SSgggsa^pqeersqmddieeletkavgmsndgrflkfd 
te???e™celqdrkltkserqrfkeeae M lkglqhpnivrfydswesw 

toswigdlg^ 

p?IecqnSq^RRVTSgvkpasfdkvaipevkeiiegcirqnkderysi^ 

EETGVRVELAEEDDGEKI AI KLWLRI EDI KKLKGKYKDNEAI EFCFDLERDVPEDVAQEM 

?esg™eothktmakaikdrvslikrkreqrqlvreeqenkkqeesslkqqveqssasq 
tc?SoTpSstgiptasttsasvstqvepeepeadqhqqlqyqqpsisvlsdgtvdsgqg 
ssvftesrvssqqtvsygfpxheqahstgtvpghipstvqaqsqphgvyppssvqqgiqq 

Spoatqpttiassvdsahsdvasgmsdgnenvpsssgrhegrttkrhyrksvrsrsrm 

^srpSrilnvsnkgdrwecqlethnrktwtfkfdldgdnpeeiati^dfi^ 

reSwweiiekademlsedvsvepegdqgleslqgkddygfsgsqklegefkqpipa 

sSp^ptssltqvvhsagrrfivspvpesrlreskvfpseitdtvaastaqspgmn 

lshsasslslqqafselrraqmtegpntappnfshtgptfpwppflssiagvpttaaat 

APVPA?SSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPIPPVSESP 

qq^^PAWSISTTSPSLQVPTSTSEIWSSTALYPSVTVSATSASAGGSTATPGPKPPA 

WSQQ^T^GATLTSVSTTTSFPSTASQLSIQLSSSTSTPTDAETVVVSAHSLDKTS 
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FIGURE IP 

HSSTTGLAFSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGPTSTPL 
LPQVPSITPLVQPVAITVPAVQQTLI^ 

TLEEKLRSLFSEHSSSGAQHASVSLETSLVIESTVTPGIPTTAVAPSKLLTSTTSTCLP^ 
^GRFSVSKTEDKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNGPSSD 

p^aaflsS—sphsp^ 

LELRRLRDKHLKEIQDLQSRQKHEIESLYTKLGKVPPAVIIPPAAPLSGRRRR^ 
KSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQNQLLQPLKPSPSSDN 

lysaftsdgai svpslsapgqgnkati ivqkq 

MSEGESQ^LSSGsBp^EsisSAPGLTSVSPPVTSTTSAASPEEEEESEDESEILEESP 

cgrwqSreewqrnvpgidsayi^dteegvev™ 
Sqilsalsy^ 

hffapeygevtnvttavdiysfgmcalemavleiqgngessyvpqeaissaiqlledplq 
refiqkclqIeparrptarellfhpalfevpslk^ 

dtsavlaeipagpgrepvqtlysqspaleldkfledvrngiypltafglprpqqpqqeev 
^spwp^Ivktptpepaevetrkvvlmqcniesveegvkhhltlllkledklnrhlscdl 
mpnenipelaaelvqlgfiseadqsrltslleetlnkfnfarnstlnsaavtvss 

cf?n TD NO- 185 AA711829 M 

lkoflkktkknhktwekawkrwctqilsalsylhscdppiihgnltcdtifiqhnglik 

gSsyv^emssaiqlledslqrefiqkclqseparrptarellfhpalfevpslklla 
Sc^ghqhmipenaleeitknmdtsavlaeipagpgrepvqtlysqspaleldkfledv 

^giypl?afglprpqqpqqeevtspwppsvk^ 

VKHHI 

FNFTRNSTLNTATVTVS s 



iLTLLL^EDKLNRHLSCDLMPNESIPDLAAELVQLGFISEADQSRLTSLLEETLNK 



msscvssq?ssnra^pqdelggr5ssssesqkpcealrglsslsihlgmesfiwtecep 

^i^lpyIpvsspqssprlprrptveshhvsitgmqdcvqlnqytlkdeigkgsygvvk 
Syneotntyyamkvlskkklirqaafprrppprgtrpapggciqprgpieqvyqeiail 
Sp^k^vlddpnedhlymvfelvnqgpvmevptlkplsedqarfyfqdlikgi 
eShto^iihrdikpsnllvgedghikiadfgvsnefkgsdallsnyvgtpafmapesls 
™?sgSSwamgvtlycfvfgqcpfmderimclhskiksqalefpdqpdiaedlk 

SSilvktmirkrsfgnpfegsrreerslsapgnlltkkptreceslselkearqrrqp 
pghrpaprggggsalvrgspcvescwapapgsparmhplrpeeamepe 

cpo TD NO: 187 5R69 17 2 H 

MOEIPQEQIKEIKKiQLSGSPwiLLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVR 
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FIGURE 1Q 

L . 

SEQ ID NO: 188 _ H8581 i"" wrqvKKTjKIEPSS NWDMTGYGSHSKVYSQSKNIPLS 
MAPVYEGMASHVQVFSPHTLQSSAFCSVK^KIEPSS™ LGGPHNLM RRSTV 

EGDYQDVQHEVLC ^^"^^X^^? y^fOH^HTCLVFEMLEQNLY DFLKQNKF S PLPLKY 
QIEVSILARLSTESADDYNFVRAYECFQHIWHTCLVt DFGS ASHVSKAVCST 

?rpvlqqvatalmklkslglii^ 

YLQSRYYRAPEIII^LP^^^^^^^^p^^^g^Qjj^gj^£jj^KYIFNCLDDMAQVN 
AE^LLSAGTKTTRFFNRDTDSPYPL^LKTPDDHEA^ 

MTTDLEGSDMLVEKADRREFIDLLKKMLTIDADKRli^ FNNQLTTV HNQPSAASM 
VKSCFQNMEICKRRVNMYDTVNQSKTPFITH^ 

AAVAQRSMPLQTGTAQICA^P" 




PVNTSHHSSSYKSKSSSNVTSTSGHSSGSS^ 
TTDRTGSHRRQQAYITPT^QAPYSFP^SPSHGTW^ 



PAALG STGTVAHLVASQGS ARHTVQHTA 
AHQTY I SAS PASTVYTGYPLSPAKVNQYPYI 



SEQ ID NO: 189_DYRK3_H rn PTrnHTOHFLDGGEMKVEQLFQEFGNR 
^DETKCPPCSNVLCNPSEPPPPRR™ 

SIPLCSVLPKLIS 
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FIGURE 1R 

YLAMNLKEQDDSIWDILVEHI SGVSLAAHLSHSGPI PVHQLRRYTAQLLSGLDYLHSNS 

WHKVLSASNVLVDAEGTVKI TDYSI SKRLADI CKEDVFEQTRVRFSDNALPYKTGKKGD 

WJRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFIN 

PQPKMPLVEQSPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 

FGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVRYYNAWIERHE 

RPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSAS 

ARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDI I FDNEDENSKSQNQDEDCNEK 

NGCHES EPS VTTEAVHYLY I QME YCEKSTLRDT I DQGL YRDTVRLWRLFRE I LDGLAY I H 

EKGMIHRDLKPVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTG 

MVGTALYVSPEVQGSTKSAYNQKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSP 

KFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLT 

NVDGKAYRTMMAQIFSQRI SPAIDYTYDSDILKGNFSIRTAKMQQHVCETI IRIFKRHGA 

VQLCTPLLLPRNRQIYEHNEAALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIE 

RVFRPRKLDRFHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEI IQEFPALQERNYSI YL 

NHTMLLKAI LLHCGI PEDKLSQVY I I LYDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKF 

IEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEWGLLKKLGIKLQVLINLGLVYK 

VQQHNG 1 1 FQFVAF I KRRQRAV PE I LAAGGRYDLL I PQFRGPQALGPVPTA I GVS I A I DK 
ISAAVLNMEESVTISSCDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQ 
EYCRHHEITYVALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNG 
REASDNLAVQNLKGSFSNASGLFEIHGATWPIVSVLAPEKLSASTRRRYETQVQTRLQT 
SLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFNTTVKQLLSRLPKQRYLKLVC 

DEIYNIKVEKKVSVLFLYSYRDDYYRILF 
SEQ ID NO: 192 R4 3 524_H, HRI_H 

MLGGNSGVRKREEEGDGAGAVAAPPAIDFPAEGPDPEYDESDVPAEIQVLKEPLQQPTFP 
FAVANQLLLVSLLEHLSHVHEPNPLRSRQVFKLLCQTFIKMGLLSSFTCSDEFSSLRLHH 
NRAITHLMRSAKERVRQDPCEDISRIQKIRSREVALEAQTSRYLNEFEELVILGKGGYGR 
VYKVRNKLDGQYYAIKKILIKGATKTVCMKVLREVKVLAGLQHPNIVGYHTAWIEHVHVI 

QPRADRAAI ELPSLEVLSDQEEDREQCGVKNDE S S SS S 1 1 FAEPTPEKEKRFGESDTENQ 
NNKSVKYTTNLVIRESGELESTLELQENGLAGLSASSIVEQQLPLRRNSHLEESFTSTEE 
SSEENVNFLGQTEAQYHLMLHIQMQLCELSLWDWIVERNKRGREYVDESACPYVMANVAT 

KI FQELVEGVFY I HNMG I VHRDLKPRNI FLHGPDQQVKI GDFGLACTD I LQKNTDWTNRN 
GKRTPTHTSRVGTCLYASPEQLEGSEYDAKSDMYSLGWLLELFQPFGTEMERAEVLTGL 
RTGQLPESLRKRCPVQAKYIQHLTRRNSSQRPSAIQLLQSELFQNSGNVNLTLQMKIIEQ 

EKEIAELKKQLNLLSQDKGVRDDGKDGGVG 
SEO ID NO:" 193 17000057519457_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPWFFVDYASNCLYMEEIEGSV 

TVRDY I QS TMETEKTPQGLSNLAKT I GQVLARMHDEDLI HGDLTTSNMLLKP PLEQLN I V 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

SEQ ID NO: 194 AA013524_M 

LVOOGAEARVFRGRFQGRAAWKHRFPKSYRHPELEARLGRRRTVQEARALLRCRRAGIA 

AP WFFVDYASNCLYMEE I EDS VTVRDY I QSTMETEKDPQCLLDLARRMGQVLAGMHDQD 
LIHGDLTTSNMLLRRPLAQLHIVLIDFGLSFVSGLPEDKGVDLYVLEKAFLSTHPHTETA 

FEAFLKS YGAS S KKS S PVLKKLDEVRLRGRKRSMVG 
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FIGURE IS 

SEQ ID NO: 195_17000139801197_H, IRAKM_H imuT ™ 
MAGNCGARGALSAHTLLFDLPPALLGELCAVLDSCDGALGWRGLAERLSSSWLDVRHIEK 

YVDQGKSGTRELLWSWAQKNKTI GDLLQVLQEMGHRRAIHLITNYGAVLS PSEKSYQEGG 
FPNILFKETANVTVDNVLIPEHNEKGVLLKSSISFQNIIEGTRNFHKDFLIGEGEIFEVY 
RVEIQNLTYAVKLFKQEKKMQCKKHWKRFLSELEVLLLFHHPNILELAAYFTETEKFCLI 

Y PYMRNGTLFDRLQCVGDTAPLPWH IRIGILIGI S KAI HYLHNVQPCS VI CGS IS SAN I L 
LDDQFQPKLTDFAMAHFRSHLEHQSCTINMTSSSSKHLWYMPEEYIRQGKLSIKTDVYSF 
GIVIMEVLTGCRWLDDPKHIQLRDLLRELMEKRGLDSCLSFLDKKVPPCPRNFSAKLFC 
LAGRCAATRAKLRPSMDEVLNTLESTQASLYFAEDPPTSLKSFRCPSPLFLENVPS I PVE 
DDESQNNNLLPSDEGLRIDRMTQKTPFECSQSEVMFLSLDKKPESKRNEEACNMPSSSCE 
ESWFPKYIVPSQDLRPYKVNIDPSSEAPGHSCRSRPVESSCSSKFSWDEYEQYKKE 

SEO ID NO- 196 AA840598_M IRAKM_M 

MW^FLSELEVLLLFRHPHILELAAYFTETEKLCLVYPYMSNGTLFDRLQCTNGTTPLSW 
HVRI SVL I GI AKAI QYLHNTQPCAVI CGNVSSANI LLDDQLQPKLTDFAAAHFRPNLEQQ 
S STI NMTGGGRKHLWYMPEEY I RQGRLS VKTDVYS FGI V I ME VLTGCKWLDDPKHVQLR 
DLLMELMEKRGLDSCLSFLDRKIPPCPRNFSAKLFSLAGRCVATKAKLRPTMDEVLSSLE 
STOPSLYFAEDPPTSLKSFRCPSPLFLDNVPSIPVEDDENQNNHSVPPKEVLGTDRVTQK 
TPFECSQSEVTFLGLDRNRGNRGSEADCNVPSSSHEECWSPELVAPSQDLSPTVISLGSS 

WEVPGHSYGSKPMEKRCSSGLFCSEHEQSKKQ 
SEO ID NO- 197 AA088547_H 

MASAVRGSRPWPRLGLQLQFAALLLGTLSPQVHTLRPENLLLVSTLDGSLHALSKQTGDL 
KWTLRDDPVIEGPMYVTEMAFLSDPADGSLYILGTQKQQGLMKLPFTIPELVHASPCRSS 
DGVFYTGRKQDAWFVVDPESGETQMTLTTEGPSTPRLYIGRTQYTVTMHDPRAPALRWNT 
TYRRYSAPPMDGSPGKYMSHLASCGMGLLLTVDPGSGTVLWTQDLGVPVMGVYTWHQDGL 
RQLPHLTIiARDTLHFLALRWGHIRLPASGPRDTATLFSTLDTQLLMTLYVGKDETGFYVS 
KALVHTGVALVPRGLTLAPADGPTTDEVTLQVSGEREGSPSTAVRYPSGSVALPSQWLLI 
GHHELPPVLHTTMLRVHPTLGSGTAETRPPENTQAPAFFLELLSLSREKLWDSELHPEEK 
TPDSYLGLGPQDLLAASLTAVLLGGWILFVMRQWEKQQETPLAPADFAHISQDAQSLHS 
GASRRSQKRLQSPSKQAQPLDDPEAEQLTWGKISFNPKDVLGRGAGGTFVFRGQFEGRA 

^lSScfglvrrevqllqesdrhpnvlryfctergpqfhyialelcraslqeyv^ 

PDLDRGGLEPEWLQQLMSGLAHLHSLHIVHRDLKPGNILITGPDSQGLGRWLSDFGLC 

kSpagrcsLlhsgipgtegwmapellqllppdsptsavdifsa 
gdslyrqaniltgapclahleeevhdkwardlvgamlsplpqprpsapqvlahpffwsr 
akqlqffqdvsdwlekeseqeplvraleaggcawrdnwhehismplqtdlrkfrsykgt 
swdllravrnkkhhyrelpvevrqalgqvpdgfvqyftnrfprlllhthramrscases 

lflpyyppdsearrpcpgatgr 

SEO ID NO- 198 HGP 6644 466 

MEGISNFKTPSKLSiKKKSVLCSTPTINIPASPFMQKLGFGTGVNVYLMKRSPRGLSHSP 

wavkkinpicndhyrsto^ 

kslndlieerySsqdpfpaai ilkvalnmarglkylhqekkllhgdikssnwikgdfe 
SScdvgvslpldenmtvtdpeacyig^ 

mtlsiphinlsnddddedktfdesdfddeayyaalgtrppinmeeldesyqkvielfsvc 
tnedpkdrpsaahivealetdv 



qFO ID NO: 199 AA44 9542 M 

sprglshspwavkkTsllcddhyrtvyqkrltdeakilknlnhpni igyrafteasdgsl 

C^E?GGEKSLNDLIEERNKDSGSPFPAAVILRVALH^RGLKYLHQE^ 
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KGDFETIKICDVGVSLPLDENMTVTDPEACYIGTEPWKPKEALEENGIITDKADVF 



FIGURE IT 

NWI KGDFET I KI CDVGVSLPLDENT 

AFGLTLWEMMTLCIPHVNLPDDDVDEDATFDESDFDDEAYYAALGTRPSINMELDDSYQK 
AIELFCVCTNEDPKDRPSAAHIVEALELDGQCCGLSESKH 

SEQ ID NO: 200_5R57_10_2_M TESK2_M 
LLDSDLYLPWTVRVKLAYGIAVGLSYLHFKGIFHRDLTSKV 

SEQ ID NO: 201 AA232253_H 

MSSLGASFVQIKFDDLQFFENCGGGSFGSVYRAKWISQDKEVAVKKLLKIEKEAEILSVL 
SHRNIIQFYGVILEPPNYGIVTEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHY 
LHMEAPVKVIHRDLKSRNWIAADGVLKICDFGASRFHNHTTHMSLVGTFPWMAPEVIQS 
LPVSETCDTYSYGWLWEMLTREVPFKGLEGLQVAWLWEKNERLTIPSSCPRSFAELLH 
OCWEADAKKRPSFKQIISILESMSNDTSLPDKCNSFLHNKAEWRCEIEATLERLKKLERD 
LSFKEQELKERERRLKMWEQKLTEQSNTPLLPSFEIGAWTEDDVYCWVQQLVRKGDSSAE 
MSVYASLFKENNITGKRLLLLEEEDLKDMGIVSKGHIIHFKSAIEKLTHDYINLFHFPPL 
IKDSGGEPEENEEKIVNLELVFGFHLKPGTGPQDCKWKMYMEMDGDEIAITYIKDVTFNT 
NLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRTPKSTKHVHLIQWSRTKPQ 
DEVKAVQLAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQIASNTSLQRSQSNPILGSP 
FFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYGLTKNFSSLHLNSRDSGFS 
SGNTDTSSERGRYSDRSRNKYGRGSISLNSSPRGRYSGKSQHSTPSRGRYPGKFYRVSQS 
ALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSKVSEGGWTKVEYRKKPHRP 

S PAKTNKERARGDHRGWRNF 

SEQ ID NO: 202_AI375137_H 

MGNYKSRPTQTCTDEWKKKVSESYVI TI ERLEDDLQI KEKELTELRNI FGSDEAFSKVNL 
NYRTENGLSLLHLCCI CGGKKSHI RTLMLKGLRPSRLTRNGFTALHLAVYKDNAEL I TSL 
LHSGADIQQVGYGGLTALHIATIAGHLEAADVLLQHGANVNIQDAVFFTPLHIAAYYGHE 
qvTRLLLKFGADVNVSGEVGDRPLHLASAKGFLNIAKLLMEEGSKADVNAQDNEDHVPLH 
FCSRFGHHDIVKYLLQSDLEVQPHWNIYGDTPLHLACYNGKFEVAKEI IQISGTESLTK 
ENIFSETAFHSACTYGKSIDLVKFLLDQNVININHQGRDGHTGLHSACYHGHIRLVQFLL 
DNGADMNLVACDPSRSSGEKDEQTCLMWAYEKGHDAIVTLLKHYKRPQDELPCNEYSQPG 
GDGSYVSVPSPLGKIKSMTKEKADILLLRAGLPSHFHLQLSEIEFHEI IGSGSFGKVYKG 
RCRNKIVAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGACLNDPSQFAIVTQ 

Y I S GG SLF S LLHEQKR I LDLQ SKL 1 1 AVDVAKGME YLHNLTQ P 1 1 HRDLN S HN ILLY EDG 
HAWADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILT 

GEIPFAHLKPAAAAADMAYHHIRPPIGYSIPKPISSLLIRGWNACPEGRPEFSEWMKLE 

ECLCNIELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAA 

LSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS 

SEO ID NO: 203 H97685 H 

mesersplyrqlidLgylssshwncgapgqdtkaqsmlveqseklrhlstfshqvlqtrl 

VDAAKALNLVHCHCLDIFINQAFDMQRDLQITPKRLEYTRKKENELYESLMNIANRKQEE 

mkdmivetlntmkeellddatnmefkdvivpengepvgtreikccirqiqeliisrlnqa 
vanklissvdylresfvgtlerclqsleksqdvsvhitsnylkqilnaayhvevtfhsgs 
svtrmlweqikqiiqritvjvsppaitlewkrkvaqeaieslsasklaksicsqfrtrlns 
sheafaaslrqleaghsgrlektedlwlrvrkdhaprlarlslesrslqdvllhrkpklg 
oelgrgqygvvylcdnwgghfpcalkswppdekhwndlalefhymrslpkherlvdlhg 
sv^ynyg^ssiavllimerlhrdlytglkagltletrlqialdvvegirflhsqglvh 
rdiklknvlldkqnrakitdlgfckpeammsgsivgtpihmapelftgkydnsvdvyafg 



1^ 
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FIGURE 1U 

ILFWYICSGSVKLPEAFERCASKDHLWNNVRRGARPERLPVFDEECWQLMEACWDGDPLK 
RP^GIVQPMLQGIMNRLCKSNSEQPNRGLDDST 

SEQ ID NO: 205_AA744236_H GKFASVEV YKRENEDKVNKAAKHLKTL 
RHPCLLRFLSCTVEADGIHLVTERVQPLE^^^^^^ RDPAS I PPEEMS PEFTT 

PEEPENQTVNIQIWPREPCDDVKSWTTLDVEESSWDU^ SSQERPLK VPSELG 

ISpSovSpS 

KF AAAE I TEGE AEGWEEEGELNWEDNNW 

SEQ ID NO: 206_AI052250_H . WKIFNGTKKSTK QE 

YPSMKNALIPRIKNACYKHLPLRFV 

SEO ID NO: 207 AA278842_H UDnT3VVA TrSPVSIFVYDVKPGAEEQTQV 
MWFfLdPWDFPFELIPEPPEGGLPGP^^ 

IKAAFKRFKTLREPNI IAVIDGLETEKCLHWTEAV 1 ^ KG j pE 

C^FL™^SLIH^C^™SEWK^^^^ pGKIPKTl ,vp m 



E^RRE^AKRAERKVAKGPMKLiGARKLD 
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FIGURE IV 



SEQ ID NO: 208_AA599286 H YIIRVQGGIS VENSVJQIVRRYSDFD 
MAFMEKPPAGKVLLDDTVPLTAAI ^^^^^^5 ^nv I TTNH I LSNCELVKKFLD 

PNNY vr^pe^^pY^YRV^AT^ES SALLI RMFNEKGTLKDLI YK 

ADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRVTFA1AN GHLHASNV MLDGDT 

Sp^pflkkyc N pkkiqglelqqi™ 

CRLLDLENSLLGLPSFYRSYFSQFRKINTLESWVHC^ 
PAPS^VVAVLESTLSCEACKNGMPTIS™ 

iaLciekrlieeqkqihqhrrltraqshhgseeer^ 

LsNSNNSAGSGASSPL = 



SEQ ID NO: 209_AA425725 H GLLGSDDEEQ EDPKD 
MSASTGGGGDSGGSGGSSSSSQASCGPE^ 

YCKGGYHPVKIGDVFNGRYH^^GWGHFSTVWLC ^ WLEV ^ H QLLKWIIKSNY 

lSleepdmilrpsieevswhpwlast 



SEQ ID NO: 2H_AA060026 M GGpEEFIQRF LPREW 
, ™™ni.GKTTGEGTYSKVKEAFSKKHQKlw/iiivi ™ B »v»UROMVE 




lSSllepdmilrpsieevswhpwlast 

SEQ ID NO: 2 12_AA3 99669_H YGSV YEAFYTKQKVMVAVKIISKKK 

mgkgdvleaaptttayhslmdeygyevg^hgsyg^ 

asddylnkflpreiqqvmkvlrh^^ 
csepiagkwfsqltlgiaylhsk™ 

SEQ ID NO: 21 3 _ M758 "^r" KSMS ERLKFNVAVKI IDRRKTPTDFVERFLPRE 
MDDATVLRKKGYIVGINLGKGSWKVKSAYSERLKF^ 



VI/) 
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FIGURE 1W 




SEQ ID NO : 216_H29974 H ApENVELAIlAE FWALTSLKRRHQNV 

DLNQYVLSRRPDPATNKSFMLQLTSAIAFLHKNH1VM EGHYTAKAD I FALG 

^"Lanpqdrpdafeletrmdqvtcaa 

SEQ ID NO: 217_AA498104 M H29974 JJ EKLRPAA QAMDPAGAEVPGEA 

FIJ^RRPDGGGGDVPARPRYSL^EIGRGSYGVV^ LYLRL VETSLKGERIL 

GYAEEPCYLWFWEYCEGGDI^QY^SRRPDPATNKb ^^^lssacGSDFYM 

pdnilitersgtpilkvadfg^ 



'Cdggdmney^srkpnrktnt™ 




^isQPKYDLIREVGRGSYG^VXK^^^ 

KS( 
FC 

SEQ ID NO: S 13 -" 01 ""?;" otmhseeaPARAQRAGMAGPGWOPPRLDGFILT 

s Tp 0 ™~rpS™ 



RI 
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FIGURE IX 

LHYEVDAORKEAIKAKVGQYVSRAEELKAIVSSSNQALLRQGTSARDLLREMARDKPRLL 
S^SA^KEEAAGGEQDALDLYQHSLGELLLLLRSPRAGGGSCFTLRFRTSWPELN 



cpo TD NO- 220 AA311714 H m 

mInfiLLigrgsktvvykgrrkgtinfvailctdkcrrpeitnwrltreikhi^iw 

FHEWYETSNHLWLVXENLPEDWREFGIDLISGLHHLHKLGILFCDISPRKILLEGPGTL 

kfInfci^kvegenleeffalvaaeegggdn^ 

FSISSDLWSLGCLLYEMFSGKPPFFSESVSELTEKILCEDPLPPIPKDSSRPKASSDFIN 

Sdgllqrdpqkrltwtrllqhsfwkkafagadqessvedlslsrntmecsgpqds 
qnsqsrqakgSSgqplghsfrlenptefrpkst^ 

SPGEDM?£™ 

VKFDAKILHLPTYSVDKLLFLKDQDWNDFLQQVCSQIDSTEKSMGASRAKLNLLCYLCVV 

IghqeSrllLplfqlliqhlriapnwdirakvahvigl 

S IGI GI LNCLVQHSTPVPRQCLVYV 
SEQ ID NO: 221_SGK384_H 

SLAHVLRARQ I LTEPEVRDYLRGLVSGLRYLHQRCI LHR 
SFO ID NO- 222 AA210451 M SGK384_M 

MGQQHGTRNGLTHRELPRGVGLLLAMALMNVALYLCLDQLFISPGRSTAD 

MGRMRNCSRWLSCEELRTEVRQLKRVGEGAVKRVFLSEWKEHKVALSRLTRL^ 

rLOMLKSLOSEHWTLVGYCEEDGTILTEYHPLGSLSNLEETLNLSKYQDVNTWQHRLQL 

Sey^inSp^ 

^cotrel^dfvapeqlwpygedtpfqddlmpsynekvdiwkipdvssfllghvegsdm 
vrfhlfdihkacksqipaerptaqnvldayqrvfhslrdtvmsqtkeml 

E^AVQ^™ 

NFLSFOEVIEDKRKAKKI idsewmqnvlgqvldaleylhhldi ihrnlkpsni ili ssdh 
sf^gSlrkslrqspgslkavlktmeekqipdvetfrnllplmlqidpsdritikd 

VVM TFLRG S FKS S CV SLTLHRQMVP AS I TDMLLEGNVAS I LGDAGDTKGERALKLL SMA 
E^VEVWTTMELHDRVLDVQLCACSLLLHLLGQALVHHPEAKAPCNQAITSTLLSALQSH 

ppeepTI™sl^itttqeseslseelqnagllehilehlnsslesrdvcasglgll^ 

se^fkvwqeeggsglsliketyqlhrddpevvenvgmllvhlasyeeilpelvsssm 
kallqeikerftsslvsdssafskpglppggspqlgcttsggle 



cn>n Tn WO- 224 AA118352 M SGK071 M 

Ied^cqkSmap^sfs 

KPILKTHEEKQIPGTDWYLLLPFMLHIN^^ 



ILKTMEEKQIPGTDWYLLLPFMLHINPSDRLAIKDVMQVTFMSNSFKSSSVALNMQR 



OKVPI FITDVLLEGNMANI LGSWLCASFVNDSKWLiJ^i^^r ^ - 

Sfssrpevqlrainklltmpedqlglpwptelleevisiikqhgrildillstcsll 
^SlgoaSSpeaei prssli i sflmdtlrshpnserlvnwynvlai issqgqi seel 

EEEGL FQLAQENLEH FQEDRD I CLS I LS LLWS LL VD WTVDKE P SGMVTWVLATH ^ 

ml I/) 
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FIGURE 1Y 

VSELVAFRIWLEEGSSGLHLIQDIYKLYKDDPEWENLCMLLAHLTSYKEILPEMESGG 
IKDLVQVIRGRFTSSLELISYADEILQVLEANAQPGLQEDQLEPPAGQEAPLQGEPLFRP 

SEQ ID NO: 225_018653 . 9_H 

GRGRGAGHARGLGRGPAGRRAEPPRSLSRPGPGPGSRAGPAGRGEGSDAAPAGGSGRGFL 
RLLPAGLRPQRALRSGSEPPRPGQSPEPSPAPGAGRRGGRGELARQIRARYEEVQRYSRG 
GPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPGPGSPGPGPRLGCA 
ALRNVSGAQYMGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGVRRGCYRLAA 
HKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVEMIQLLQTSWEDRF 
RICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVDGELKVTDLDDARVEETPCAGSTDCI 
LEFPARNFTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSIVNATGE 
LAWGVDETLAQLEKVLHLYRSGQYLQNSTASSSTEYQCIPDSTIPQEDYRCWPSYHHGSC 
LLSVFNLAEAVDVCESHAQCRAFWTNQTTWTGRQLVFFKTGWSQWPDPNKTTYVKASG 

SEQ ID NO: 226_AA396601_M 

TRPGCAALRNVSGAQYVGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGARRG 
CYRLAAHKLLKEMVLLERLRHPNVLQLYGYCYQDSEGIPDTLTTITELGAPVEMIQLLQT 
SWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVNGELKVTDLDDARVEETPCT 
SSADCTLEFPARNFSLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSI 
VNATGELAWGVDETLAQLETALHLFRSGQYLQNSTSSRAEYQRI PDSAI TQEDYRCWPSY 
HHGGCLLSVFNLAEAIDVCESHAQCRAFWTNQTTWTGRKLVFFKTGWNQWPDAGKTTY 

VKAPG 

SEQ ID NO: 227_VRK3_H 

MISFCPDCGKSIQAAFKFCPYCGNSLPVEEHVGSQTFVNPHVSSFQGSKRGLNSSFETSP 
KKVKWSSTVTSPRLSLFSDGDSSESEDTLSSSERSKGSGSRPPTPKSSPQKTRKSPQVTR 

GSPQKTS CS PQKTRQS PQTLKRSRVTTSLEALPTGTVLTDKSGRQWKLKS FQTRDNQG I L 
YEAAPTSTLTCDSGPQKQKFSLKLDAKDGRLFNEQNFFQRAAKPLQVNKWKKLYSTPLLA 
IPTCMGFGVHQDKYRFLVLPSLGRSLQSALDVSPKHVLSERSVLQVACRLLDALEFLHEN 
EYVHGNVTAENIFVDPEDQSQVTLAGYGFAFRYCPSGKHVAYVEGSRSPHEGDLEFISMD 
LHKGCGPSRRSDLQSLGYCMLKWLYGFLPWTNCLPNTEDIMKQKQKFVDKPGPFVGPCGH 
WIRPSETLQKYLKWMALTYEEKPPYAMLRNNLEALLQDLRVSPYDPIGLPMVP 



SEQ ID NO: 228_S71575_M VRK3_M 

I PTC I GFG I HQDKYRFLVFP SLGRS LQSALDDN PKH W S ERCVLQVACRLLDALEYLHEN 
EYVHGNLTAENVFVNPEDLSQVTLVGYGFTYRYCPGGKHVAYKEGSRSPHDGDLEFISMD 
LHKGCGPSRRSDLQTLGYCMLKWLYGSLPWTNCLPNTEKITRQKQKYLDSPERLVGLCGR 
WNKASETLREYLKWMALNYEEKPPYATLRNSLEALLQDMRVSPYDPLDLQMVP 

SEQ ID NO: 229_AA45427_H ™^ 
MGHALCVCSRGTV 1 1 DNKRYLF I QKLGEGGFS YVDLVEGLHDGHFYALKR I LCHEQQDRE 
EAQREADMHRLFNHPNILRLVAYCLRERGAKHEAWLLLPFFKRGTLWNEIERLKDKGNFL 

TEDQ I LWLLLG I CRGLEAI HAKGYAHRDLKPTN I LLGDEGQPVLMDLGSMNQAC I HVEGS 
RQALTLQDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQ 
KGDSVALAVQNQLS I PQS PRHS SALRQLLNSMMTVDPHQRPHI PLLLSQLEALQP PAPGQ 

HTTQI 

SEO ID NO: 230 H05721_H 

MAVRQALGRGLQLGRALLLRFTGKPGRAY GLGR PG P AAGC VRGER PG WAAG PGAE PRRVG 
LGLPNRLRFFRQSVAGLAARLQRQFWRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRR 
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AVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQ 
NLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPLAIKMMWNISAGSSSEAILNTMSQE 
LVPASRVALAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLP 
SRLHPEGLGHGRTLFLVMKNYPCTLRQYLCVNTPSPRLAAMMLLQLLEGVDHLVQQGIAH 
RDLKSDNILVELDPDGCPWLVIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVST 
ARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPP 
DVRQLVRALLQREASKRPSARVAANVLHLSLWGEHILALKNLKLDKMVGWLLQQSAATLL 
ANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL 

SEQ ID NO: 231_AI086865_H 

MEKYERIRWGRGAFGIVHLCLRKADQKLVI IKQIPVEQMTKEERQAAQNECQVLKLLNH 
PNVIEYYENFLEDKALMIAMEYAPGGTLAEFIQKRCNSLLEEETILHFFVQILLALHHVH 
THLI LHRDLKTQNI LLDKHRMWKI GDFG I SKI LS SKSTPCYI SPELCEGKPYNQKSDI W 

algcvlyelaslkrafeaanlpalvlkimsgtfapisdryspelrqlvlsllslepaqrp 
plshimaqplcirallnlhtdgrevrgpqqhreqdhqcplqrgiimtfgsgsngclghgs 
ltdisqptiveallgyemvqqveealsftllgsapldqepllsidlgtahsaavtgeedl 
gsgdvnrlpswerghllagvasstdvstfsegdckepdkccwrhkqctghi iypfasdcv 
rhslhlhsvnhcncnsrlkdssedssssrgagptcshviespcfeltpeeehverfrygw 
cksyrpvsvavihhplyhecgaddlnxkkrkrrrrkskppi ptqvgpataspdlgtsmat 
gtpdstapitiwrsesptgkgqgskvikkvkkkkekekdkeemdekaklkkkakkgqltk 
kkspvklepsppdvsrslsarqlarmsesspesreelesedsyngrgqgelssedi vess 
sprkrentvqakktgakpsqarkvnkrksppgsnpnls 

seq id no: 232_aa836348_h 

msvlgeyerhcdsinsdfgsesggcgdsspgpsasqgpragggaaeqeelhyipirvl.gr 
gafgeatlyrrteddslwwkevdltrlsekerrdalneivilallqhdni iayynhfmd 

NTTLLIELEYCNGGNLYDKILRQKDKLFEEEMWWYLFQIVSAVSCIHKAGILHRDIKTL 
NIFLTKANLIKLGDYGLAKKLNSEYSMAETLVGTPYYMSPELCQGVKYNFKSDIWAVGCV 
IFELLTLKRTFDATNPLNLCVKIVQGIRAMEVDSSQYSLELIQMVHSCLDQDPEQRPTAD 
ELLDRPLLRKRRRSSTVTEAPIAWTSRTSEVYVWGGGKSTPQKLDVIKSGCSARQVCAG 
NTHFAWTVEKELYTRVNMQGGTKLHGQLGHGDKASYRQPKHVEKLQGKAIRQVSCGDDF 
TVCVTDEGQLYAFGSDYYGCMGVDKVAGPEVLEPMQLNFFLSNPVEQVSCGDNHVWLTR 
NKEVYSWGCGEYGRLGLDSEEDYYTPQKVDVPKALIIVAVQCGCDGTFLLTQSGKVLACG 
LNEFNKLGIiNQCMSGIINHEAYHEVPYTTSFTLAKQLSFYKIRTIAPGKTHTAAIDERGR 

lltfgcnkcgqlgvgnykkrlginllggplggkqvirvscgdeftiaatdekvlnsktir 
snssglsigtvfqssspgggggggggeeedsqqesetpdpsggfrgtmeadrgmeglisp 
teamgnsngassscpgwlrkelenaefipmpdspsplsaafsesekdtlpyeelqglkva 
seaplehkpqveasvtelfafesqlvtsaescsnlcwegnttdsscvcvqlsaggg 

seq id no: 233_r86668_h, mkk6_h 

mnlllsyrdvqdysaiielvetlqalptcdvaeqhnvcfhytfalnrrnrpgdrakalsv 

LLPLVQLEGS VAPDLYCMCGRI YKDMFFS SGFQDAGHREQA YHWYRKAFDVE PSLHSGIN 
AAVLLI AAGQHFEDSKELRLIGMKLGCLLARKGCVEKMQYYWDVGFYLGAQ I LANDPTQV 
VLAAEQLYKLNAPIWYLVSVMETFLLYQHFRPTPEPPGGPPRRAHFWLHFLLQSCQPFKT 
ACAQGDQCLVLVLEMNKVLLPAKLEVRGTDPVSTVTLSLLEPETQDI PSSWTFPVAS I CG 
VSASKRDERCCFLYALPPAQDVQLCFPSVGHCQWFCGLIQAWVTNPDSTAPAEEAEGAGE 
MLEFDYEYTETGERLVLGKGTYGWYAGRDRHTRVRIAI KEI PERDSRFSQPLHEEIALH 
RRLRHKNIVRYLGSASQGGYLKIFMEEVPGGSLSSLLRSVWGPLKDNESTISFYTRQILQ 
GLGYLHDNH I VHRD I KGDNVLINTFSGLLKI SDFGTS KRLAG I TPCTETFTGTLQ YMAPE 
IIDQGPRGYGKAADIWSLGCTVIEMATGRPPFHELGSPQAAMFQVGMYKVHPPMPSSLSA 
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FIGURE 1AA 

EAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKRSRSPSSPRHAPRPSDAPSASPTPSAN 
STTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQLRVPEEPAAEEPASPEESSGLSLLHQE 
SKRRAMLAAVLEQELPALAENLHQEQKQEQGARLGRNHVEELLRCLGAH I HTPNRRQLAQ 
ELRALQGRLRAQGLGPALLHRPLFAFPDAVKQILRKRQIRPHWMFVLDSLLSRAVRAALG 
VLGPEVEKEAVSPRSEELSNEGDSQQSPGQQSPLPVEPEQGPAPLMVQLSLLRAETDRLR 
EILAGKEREYQALVQRALQRLNEEARTYVLAPEPPTALSTDQGLVQWLQELNVDSGTIQM 
LLNHSFTLHTLLTYATRDDLIYTRIRGGMVCRIWRAILAQRAGSTPVTSGP 

SEQ ID NO: 234_PAK6_H 

MFGKKKKKIEISGPSNFEHRVHTGFDPQEQKFTGLPQQWHSLLADTANRPKPMVDPSCIT 
PIQLAPMKTIVRGNKPCKETSINGLLEDFDNISVTRSNSLRKESPPTPDQGASSHGPGHA 
EENGFITFSQYSSESDTTADYTTEKYREKSLYGDDLDPYYRGSHAAKQNGHVMKMKHGEA 
YYSEVKPLKSDFARFSADYHSHLDSLSKPSEYSDLKWEYQRASSSSPLDYSFQFTPSRTA 
GTSGCSKESLAYSESEWGPSLDDYDRRPKSSYLNQTSPQPTMRQRSRSGSGLQEPMMPFG 
ASAFKTHPQGHSYNSYTYPRLSEPTMCIPKVDYDRAQMVLSPPLSGSDTYPRGPAKLPQS 
QSKSGYSSSSHQYPSGYHKATLYHHPSLQSSSQYISTASYLSSLSLSSSTYPPPSWGSSS 
DQQPSRVSHEQFRAALQLWSPGDPREYLANFIKIGEGSTGIVCIATEKHTGKQVAVKKM 
DLRKQQRRELLFNEWIMRDYHHDNWDMYSSYLVGDELWWMEFLEGGALTDIVTHTRM 
NEEQIATVCLSVLRALSYLHNQGVIHRDIKSDSILLTSDGRIKLSDFGFCAQVSKEVPKR 
KSLVGTPYWMAPEVISRLPYGTEVDIWSLGIMVIEMIDGEPPYFNEPPLQAMRRIRDSLP 
PRVKDLHKVSSVLRGFLDLMLVREPSQRATAQELLGHPFLKLAGPPSCI VPLMRQYRHH 

SEQ ID NO: 2 3 5_SURTK1 0 6_H 

MNDRNEI QMEAKLQSLTI IAQEI LCRFF I TLRRHARFLLTKLGRQGMARSG I THSCAVCI 
LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVI IVPTLLVTIFLILLGVILWL 
FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ 
VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF 
HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLGFLWTCRRDVMTMDGLLYDLTEKQ 
VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT 

I PLKWLAPERLLLRPAS I RADVWS FG I LLYEMVTLGAP PY PEVPPTS I LEHLQRRKI MKR 
PSSCTHTMYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVLQVPELWPELYAAV 

AGIRVESLFYNYSML 

SEQ ID NO: 236_AA098024_M 

LQEKHLFHGDVAARNI LI QSDLTPKLCHLGLAYEVHAHGAI SSARSSTI PLKWLAPERLL 
LRPASIRGDIWSFGILLYEMVTLGAPPYPEVPPTSILQYLQRKKIMKRPSSCSHAMYNIM 
KCCWRWSEDSRPLLVQLLQRLEAASRSADDKAVLQVPELWPELYADVAGIRAES I SYSF 

SVL 

SEQ ID NO: 237_SGK2ALPHA_H 

MNS S PAGTP S PQPSRANGN I NLGP SANPNAQPTDFDFLKV I GKGNYGKVLLAKRKSDGAF 
YAVKVLQKKS I LKKKEQS H I MAERS VLLKNVRHPFLVGLRYS FQTPEKL YF VLD YVNGGE 
LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNI IYRDLKPENILLDCQGHWLTDFGL 
CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS 
QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL 
YHKRLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPE 

DDDILDC 
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FIGURE IBB 

SEQ ID NO: 2 38_CCRK_H vin/Jvi.RRLEDGFPNQALREIKALQEMED 
MOTCILGRIGEGAHGIVFKAKHVETG^ 

NQYWQLKAVF PHGGG ^^^pX^^Qg^j^xHQVATR S VGC I MGELLNG S 

lilVHKDIJ*^ 

PLFPGKNDI EQLCYVLRI ^^^^Y^p^p^p^psgLPI PQRLGGPAPKAHPGP PHIH 
LDLLGQFLLYPPHQRIAAS^LHOTFTAPL^ 

SsqgShmqnddaclgaaspeccllvkekcre 

SEQ ID NO: 239JTESK2 H SQVG RVWPSSYRALISAFSRLTRLDDFT 

mdrskrnsiagfpprverleefegggg^^^ 

CEKIGSGFFSEVFIWRHRAS^^ 
STSGIGLQTQGKQDG 



a 
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FIGURE 2A 

SEQ ID NO: 1_X69117_H BARK2_H 

ATGGCGGACCTGGAGGCCGTGCTGGCCGATGTCAGTTACCTGATGGCCATGGAGAAGAGC 

AAGGCGACCCCGGCCGCCCGCGCCAGCAAGAGGATCGTCCTGCCGGAGCCCAGTATCCGG 

AGTGTGATGCAGAAGTACCTTGCAGAGAGAAATGAAATAACCTTTGACAAGATTTTCAAT 

CAGAAAATTGGTTTCTTGCTATTTAAAGATTTTTGTTTGAATGAAATTAATGAAGCTGTA 

CCTCAGGTGAAGTTTTATGAAGAGATAAAGGAATATGAAAAACTTGATAATGAGGAAGAC 

CGCCTTTGCAGAAGTCGACAAATTTATGATGCCTACATCATGAAGGAACTTCTTTCCTGT 

TCACATCCTTTCTCAAAGCAAGCTGTAGAACACGTACAAAGTCATTTATCCAAGAAACAA 

GTGACATCAACTCTTTTTCAGCCATACATAGAAGAAATTTGTGAAAGCCTTCGAGGTGAC 

ATTTTTCAAAAATTTATGGAAAGTGACAAGTTCACTAGATTTTGTCAGTGGAAAAACGTT 

GAATTAAATATCCATTTGACCATGAATGAGTTCAGTGTGCATAGGATTATTGGACGAGGA 

GGATTCGGGGAAGTTTATGGTTGCAGGAAAGCAGACACTGGAAAAATGTATGCAATGAAA 

TGCTTAGATAAGAAGAGGATCAAAATGAAACAAGGAGAAACATTAGCTTTAAATGAAAGA 

ATCATGTTGTCTCTTGTCAGCACAGGAGACTGTCCTTTCATTGTATGTATGACCTATGCC 

TTCCATACCCCAGATAAACTCTGCTTCATCCTGGATCTGATGAACGGGGGCGATTTGCAC 

TACCACCTTTCACAACACGGTGTGTTCTCTGAGAAGGAGATGCGGTTTTATGCCACTGAA 

ATCATTCTGGGTCTGGAACACGTGCACAATCGGTTTGTTGTCTACAGAGATTTGAAGCCA 

GCAAATATTCTCTTGGATGAACATGGACACGCAAGAATATCAGATCTTGGTCTTGCCTGC 

GATTTTTCCAAAAAGAAGCCTCATGCGAGTGTTGGCACCCATGGGTACATGGCTCCCGAG 

GTGCTGCAGAAGGGGACGGCCTATGACAGCAGTGCCGACTGGTTCTCCCTGGGCTGCATG 

CTTTTCAAACTTCTGAGAGGTCACAGCCCTTTCAGACAACATAAAACCAAAGACAAGCAT 

GAAATTGACCGAATGACACTCACCGTGAATGTGGAACTTCCAGACACCTTCTCTCCTGAA 

CTGAAGTCCCTTTTGGAGGGCTTGCTTCAGCGAGACGTTAGCAAGCGGCTGGGCTGTCAC 

GGAGGCGGCTCACAGGAAGTAAAAGAGCACAGCTTTTTCAAAGGTGTTGACTGGCAGCAT 

GTCTACTTACAAAAGTACCCACCACCCTTGATTCCTCCCCGGGGAGAAGTCAATGCTGCT 

GATGCCTTTGATATTGGCTCATTTGATGAAGAGGATACCAAAGGGATTAAGCTACTTGAT 

TGCGACCAAGAACTCTACAAGAACTTCCCTTTGGTCATCTCTGAACGCTGGCAGCAAGAA 

GTAACGGAAACAGTTTATGAAGCAGTAAATGCAGACACAGATAAAATCGAGGCCAGGAAG 

AGAGCTAAAAATAAGCAACTTGGCCACGAAGAAGATTACGCTCTGGGGAAGGACTGTATT 

ATGCACGGGTACATGCTGAAACTGGGAAACCCATTTCTGACTCAGTGGCAGCGTCGCTAT 

TTTTACCTCTTTCCAAATAGACTTGAATGGAGAGGAGAGGGAGAGTCCCGGCAAAATTTA 

CTGACAATGGAACAGATTCTCTCTGTGGAAGAAACTCAAATTAAAGACAAAAAATGCATT 

TTGTTCAGAATAAAAGGAGGGAAACAATTTGTCTTGCAATGTGAGAGTGATCCAGAGTTT 

GTGCAGTGGAAGAAAGAGTTGAACGAAACCTTCAAGGAGGCCCAGCGGCTATTGCGTCGT 

GCCCCGAAGTTCCTCAACAAACCTCGGTCAGGTACTGTGGAGCTCCCAAAGCCATCCCTC 

TGTCACAGGAACAGCAACGGCCTCTGA 
SEQ ID NO: 2 AA144574_M BARK2_M 

CTGCTTCGTAGTCTACAGAGACCTGAAGCCTGCGAACATCCTCCTAGATGAATATGGGCA 
CGTGAGGATATCGGATCTCGGCCTTGCCTGTGATTTCTCCAAAAAGAAGCCTCATGCCAG 
CGTGGGCACCCATGGGTACATGGCTCCCGAGGTGTTGCAGAAGGGAACGTGCTATGACAG 
CAGCGCCGACTGGTTCTCCCTGGGCTGTATGCTCTTCAAACTTCTGCGGGGCCACAGCCC 
CTTCAGGCAGCATAAAACCAAAGACAAGCATGAGATAGACCGAATGACCCTGACCGTGAA 
CGTGCAGCTTCCAGATGCCTTCTCCCCTGAGCTGAGGTCCCTCTTAGAGGGTTTGCTCCA 
GCGGGACGTGAGCCAGCGGCTGGGCTGCGGAGGAGGAGGGGCACGAGAGTTGAAGGAGCA 
CATCTTCTTCAAGGGCATTGACTGGCAGCATGTGTACTTACGGAAGTACCCGCCACCCCT 
AATCCCTCCTCGGGGAGAGGTCAACGCTGCAGATGCCTTCGATATCGGCTCCTTCGATGA 
GGAAGACACCAAAGGCATTAAGCTGTTGGACTGTGACCAGGACCTCTATAAGAACTTCCC 
ACTGGTGATCTCCGAGCGCTGGCAGCAAGAAGTGGTGGAGACCATCTATGACGCCGTCAA 
TGCTGATACTGATAAAATCGAGGCCAGGAAGAAGGCTAAAAATAAGCAACTTGGTCAAGA 
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FIGURE 2B 



GGAAGATTACGCTATGGGGAAGGACTGCATCATGCACGGGTACATGCTGAAGCTGGGGAA 
CCCCTTTCTCACACAGTGGCAAAGACGCTATTTTTACCTGTTCCCCAACAGACTGGAGTG 
GAGAGGAGAGGGCGAGTCTCGGCAAAGTCTACTGACCATGGAACAGATCATGTCTGTGGA 
GGAGACCCAGATTAAAGACAGAAAGTGCATCTTACTCAGGATAAAGGGAGGGAAGCAATT 
TGTCTTGCAATGTGAGAGTGACCCCGAGTTTGCACAGTGGCTGAAGGAGCTGACCTGCAC 
CTTCAATGAGGCCCAGAGACTGCTGCGCCGTGCCCCCAAATTCCTCAACAAACCACGGGC 
CGCCATCCTGGAGTTCTCCAAGCCACCACTGTGTCACAGAAATAGCAGCGGCCTCTGAAC 
CACAGAGCAGCGGGGCCTGAAGGAGGGGCCCCAGCTCTTCAGCCCAGGAGTGGAACGAAG 
CCACGGGGAACCGTGTGGGGCTAAGACACAGTGTTTCTGAGCACTGACGGGGCTGCTCCA 
AGCCGAGGAGGCTCAGGACACCAGGGCGGCCTTCTGGGAGCTGGGACATCCTCGGGGCTG 
TCCTATCCACACTCGAAATTACTGAAGAAGCAGAGGCATTCTGCTGTG 

SEQ ID NO: 3_AA826850_H 

GAAGAGGATGGGCTCGTCCATGTCGGCGGCCACCGCGCGGAGGCCGGTGTTTGACGACAA 

GGAGGACGTGAACTTCGACCACTTCCAGATCCTTCGGGCCATTGGGAAGGGCAGCTTTGG 

CAAGGTGTGCATTGTGCAGAAGCGGGACACGGAGAAGATGTACGCCATGAAGTACATGAA 

CAAGCAGCAGTGCATCGAGCGCGACGAGGTCCGCAACGTCTTCCGGGAGCTGGAGATCCT 

GCAGGAGATCGAGCACGTCTTCCTGGTGAACCTCTGGTACTCCTTCCAGGACGAGGAGGA 

CATGTTCATGGTCGTGGACCTGCTACTGGGCGGGGACCTGCGCTACCACCTGCAGCAGAA 

CGTGCAGTTCTCCGAGGACACGGTGAGGCTGTACATCTGCGAGATGGCACTGGCTCTGGA 

CTACCTGCGCGGCCAGCACATCATCCACAGAGATGTCAAGCCTGACAACATTCTCCTGGA 

TGAGAGAGGACATGCACACCTGACCGACTTCAACATTGCCACCATCATCAAGGACGGGGA 

GCGGGCGACGGCATTAGCAGGCACCAAGCCGTACATGGCTCCGGAGATCTTCCAXTCTTT 

TGTCAACGGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGC 

CTATGAGCTGCTGCGAGGATGGAGGCCCTATGACATCCACTCCAGCAACGCCGTGGAGTC 

CCTGGTGCAGCTGTTCAGCACCGTGAGCGTCCAGTATGTCCCCACGTGGTCCAAGGAGAT 

GGTGGCCTTGCTGCGGAAGCTCCTCACTGTGAACCCCGAGCACCGGCTCTCCAGCCTCCA 

GGACGTGCAGGCAGCCCCGGCGCTGGCCGGCGTGCTGTGGGACCACCTGAGCGAGAAGAG 

GGTGGAGCCGGGCTTCGTGCCCAACAAAGGCCGTCTGCACTGCGACCCCACCTTTGAGCT 

GGAGGAGATGATCCTGGAGTCCAGGCCCCTGCACAAGAAGAAGAAGCGCCTGGCCAAGAA 

CAAGTCCCGGGACAACAGCAGGGACAGCTCCCAGTCCGAGAATGACTATCTTCAAGACTG 

CCTCGATGCCATCCAGCAAGACTTCGTGATTTTTAACAGAGAAAAGCTGAAGAGGAGCCA 

GGACCTCCCGAGGGAGCCTCTCCCCGCCCCTGAGTCCAGGGATGCTGCGGAGCCTGTGGA 

GGACGAGGCGGAACGCTCCGCCCTGCCCATGTGCGGCCCCATTTGCCCCTCGGCCGGGAG 

CGGCTAGGCCGGGATGCCCGTGGTCCTCACCCCTTGAGCTGCTTTGGAGACTCGGCTGCC 

AGAGGGAGGGCCATGGGCCGAGGCCTGGCATTCACGTTCCCACCCAGCCTGGCTGGCGGT 

GCCCACAGTGCCCCGGACACATTTCACACCTCAGGCTCGTGGTGGTGCAGGGGACAAGAG 

GCTGTGGGTGCAGGGGACACCTGTGGAGGGCATTTCCCGTGGGCCCCCGAGACCCGCCTA 

GATGGAGGAAGCGCTGCTGGGCGCCCTCTTACCGCTCACGGGGAGCTGGGGCCATGGATG 

GGACAGGAGTCTTTGTCCCTGCTCAGCCCGGAGGCTGTGCACGGCCCTCGTCACAAGGTG 

ACCCTTGCAGCACAGGCCGCGGGTGCCCCAGGCTCGGCTCAGTTCTTGGAGGTCAAGGGC 

ATGGGTTGGGGTAGTGGGTGGGGAGGTGAATGTTTTCTAGAGATTCAAACTGCTCCAGCA 

ATTTCTGTATAGTTTTCACCTCTGAGAATTACAATGTGAGAACCGCTC 

SEQ ID NO: 4 AA960957_H 

GTCCCACATCCCGCATCCGGCATCCCAGCGGCCGGGCATGTAGCAGCGGCAGCAACGGCG 
GAATATGGGCGGGAACCACTCCCACAAGCCCCCCGTGTTTGACGAGAATGAGGAAGTCAA 
CTTTGACCATTTTCAGATTCTGCGGGCCATTGGTAAAGGGAGTTTTGGAAAGGTATGCAT 
CGTGCAGAAGCGAGACACTAAGAAAATGTATGCAATGAAGTACATGAACAAGCAGAAGTG 
CATCGAGAGGGATGAGGTTCGGAATGTTTTCCGGGAGCTGCAGATCATGCAAGGGCTGGA 
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-TGTGAAACTCTACATCTGTGAGCTr 



FIGURE 2C 

•ACTCCTTCCAGGATGAGGAGGACATGTTCATGGT 



ggtggacctgctcctgggaw^^u "~ gagtatcttcagag 

--ggact^^^ 



GTTCAAGGTGGAGCGTGTCCACTACTCCTCCAC 
GAGGAAGCTCCTGACCAAGGATCCTGAGAGCCGCGTGTC^ 

CGTGCCCTACTTGGC^^ 

CTTTGTGCCCAATAAAGGGAGGTTGAACTGCGATCCCAC^ 
TCTAGAATCCAAGCCACTTCACAAAAAGAAG^AAGCGATTGGCA^ 

TGGCACAAAGGACAGCTGCCCGCTGA^TGGACAC^ 

GGAGGAATTCATCATATTCAACAGAGAGAAGC^ 

GCTCTTGGACACCGACAGCCGAGGGGGAGGCCAGGCCC^ 

CAACAACAACCTCCTCACCCACACCTGCACCCG^ 

GGGCACTTCCGAAAAACACAGCCCTGALA^^^ aagTGTCTTCATGTGCATTCT 
CTATGGAAACAACGCTGCCAAATCCTGGAGCAAAACCTGAAG^ 

AGTTTCAGCAGTCCTCAC^CCACC^TATCCCC. GTGT AATCAGCACCCC 
TTCAGATC^GAGTTGGGTCGCTGAGCATTGGTTACTCCIG^ GCTTTC 

GCTGTCCATGTCATCTCTTAGATTTCTTAAAAGACA aacCGAGACTATTTC 

ATTTTTATTTTTTAAAAAAGAAAT^ 

TGGATTGTGTGCTCCTCGTCAGTTGACT^ 

TGGCATTTCTTGTTTGGAATAAACTATTTCTTGGACATTCCTTC 
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FIGURE 2D 

SEQ ID NO: 5_TBK1_H ArrrrGCCGGCG GTGGCGCGGCGGAGACCCGGCTG 
TTTTATCTGATATTTTAGGCCA^GGAGGTACT^ 

tI?ttgctattgaagaggagacaacaacaagacat^ 

CATGTGGGAGTTTATACACTGTTTTAGAAGAACCTTC^ 

CTGAATTCTTAATTGTTTTGCGAGA^ 
GTATAGTGCACCGTGATATCAAGCCAGGAAATATCA^ 



AGTCTGTGTACAAACTCACAGATTTTGGTG^ 
TTGTTTCTCTGTATGGCACAGAAGAATATTTGCACCCT^ 

TAAGAAAAGATCATCAGAAGA^TATGGAGCAA^ 

CATTTTACCATGCAGCTACTGGATCACTGCC^ 

ATAAAGAAGTGATGTATAAAATA^ 

CTCGGGGTCTTCAGGTTCTACTTACCC^ 

AAAAGTGTTGGGGTTTTGACCAGTTTTTTGCA^ 

TAATTCATGTTTTTTCGCTACAACAAAT^ 

ATACTGCTACTATATTTCATGAACTGGTATATAAA aagGCTGGCACAACATT 
AAGAACTTATCTACGAAGGGCGACGCTTA^GTCTTAG 

TCCCTAAAACTACTGAGGAAAACCCTA^ 
TAGGATTAATATATGAAAAAATTTCCC^ 

TGGAAGCGGCAGAGTTAGGTGA^TTTCAG^ 

G gI T /c=^ 

AAAAACTACAAGTCCTGTTAAATTGCATGA^ 
AAGCAGAACGTAGATTAGCTTATA^TGAAGAAC^ 

TGTATTACCATGCCACAAAAGCTA^ 
AGGCATTTTTGAATA^GTCAGAAG^ 
TATTATCGCTGACTAATCAGTGTTTTGATAl^ 

ATACTAATGAGTTACAAGAA^CTCTGCCTCAGAA^ TAGAAAT GACTCTTGGTA 

AACATACCATGACCCCAATTTATCCAAGTTCTAA^ 

TGAAGAAATTAAAGGAAGAGATGGAAGGGGTGGTTA^ 

ssss 




ATAMGTGGCATTATTTraTI^A^G^CC^GATCC ^^^c 
SSKSS S^TTAAAAXTCTAAGGTCATTTCAACT 
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GTTTTAAGCTGTATATTTCTTTAATTCTGCTTACTATTTCATGGAAAAAAATAAATTTCT 
CAATTTTAAAAAA 

SEQ ID NO: 6_AA30517 6_H 

TGGCTGCTCGCGGAGGGGCAGTGTACGCGGGGCCGCTGTAGGCTGTCCAGCGATGGATCC 
CACCGCGGGAAGCAAGAAGGAGCCTGGAGGAGGCGCGGCGACTGAGGAGGGCGTGAATAG 
GATCGCAGTGCCAAAACCGCCCTCCATTGAGGAATTCAGCATAGTGAAGCCCATTAGCCG 
GGGCGCCTTCGGGAAAGTGTATCTGGGGCAGAAAGGCGGCAAATTGTATGCAGTAAAGGT 
TGTTAAAAAAGCAGACATGATCAACAAAAATATGACTCATCAGGTCCAAGCTGAGAGAGA 
TGCACTGGCACTAAGCAAAAGCCCATTCATTGTCCATTTGTATTATTCACTGCAGTCTGC 
AAACAATGTCTACTTGGTAATGGAATATCTTATTGGGGGAGATGTCAAGTCTCTCCTACA 
TATATATGGTTATTTTGATGAAGAGATGGCTGTGAAATATATTTCTGAAGTAGCACTGGC 
TCTAGACTACCTTCACAGACATGGAATCATCCACAGGGACTTGAAACCGGACAATATGCT 
TATTTCTAATGAGGGTCATATTAAACTGACGGATTTTGGCCTTTCAAAAGTTACTTTGAA 
TAGAGATATTAATATGATGGATATCCTTACAACACCATCAATGGCAAAACCTAGACAAGA 
TTATTCAAGAACCCCAGGACAAGTGTTATCGCTTATCAGCTCGTTGGGATTTAACACACC 
AATTGCAGAAAAAAATCAAGACCCTGCAAACATCCTTTCAGCCTGTCTGTCTGAAACATC 
ACAGCTTTCTCAAGGACTCGTATGCCCTATGTCTGTAGATCAAAAGGACACTACGCCTTA 
TTCTAGCAAATTACTAAAATCATGTCTTGAAACAGTTGCCTCCAACCCAGGAATGCCTGT 
GAAGTGTCTAACTTCTAATTTACTCCAGTCTAGGAAAAGGCTGGCCACATCCAGTGCCAG 
TAGTCAATCCCACACCTTCATATCCAGTGTGGAATCAGAATGCCACAGCAGTCCCAAATG 
GGAAAAAGATTGCCAGGTTTGAGGGACATTTATCTTAATGAAAATCAATTATGTATGTCA 
AATGAATGTGAGAAATATTATACCTTTTCATATAAATTCCATAAAGAAATGAAATTGTTA 
CATGAATGGCAGTCATAGTATTAATCAGAAATTCATTTTCCTGCACATTCTGTCAAATTC 
TTTTGAAATATTTCATTTCTCATTCAATTGTGACATTGTTCTTACTTGATTATATAATGA 
GATTCTTGCAGTAAATTGATAATAAATGCTTGGCTTCTGTGTATCTAGGTGGACCTCACT 
TGTTTTTAGAAGTCCTTCCCATGATACAGACATTGGCTTGTTGGTTTTGTTTTATTTTGT 
TTTTAACATATGTCATTTAAAAACTCATATTACCTCCTTTT 

SEQ ID NO: 7_AA116 841_M 

CCACGCGTCCGATCCCATGGCCAGAAGGCGAAGAAAAGCTATCTGATAATGCTCAAAGTG 
CAATGGACATGCTTTTAACCATTGATGATTCAAAGAGAGCTGGAATGAGAGAACTAAAAC 
AGCATCCTCTCTTCAGTGAAGTGGACTGGGAAAATCTGCAGCATCAGACTATGCCTTTCG 
TACCCCAACCAGACGACGAAACAGATACATCCTATTTTGAAGCCAGAAATAATGCTCAAC 
ATCTGACCGTATCTGGGTTTAGTCTGTAGCACATGCGTGTCATTTTTATCTAACTTGTGA 
TATAGAATTAAGTTTTACAGTAATATGCTACTTAATACTAGATTGGTCTAAATGGGATAA 
AAGTCATTATTTTACCCAGACTGAACAGCTTTTAATTACTAAGTACAACAGTTTTTACAG 
AATT AAAATAC TATAAGC AATATAATC AG TAATT AATC TTT A C C TTAG AACTGT AT ATAA 
GCCATAATAGCTTTTTTCATCTTATTTATTCACTGCACTTTATGAAGAGCAAAGTATCAA 
TAAACTAAAACACTACCACTCTAAATAGAGGGAGTGAGCCGT 

SEQ ID NO: 8_AA256100_H 

AGGGAGCTGACGGGCGCCCGGCCGGCTGCGGTCCGTGCGGAGGCTGAGCCGGCCGCGGGC 
GCGACCGGAGGCAGTTTCCGTTACTATGGCAATGACGGCAGGGACTACAACAACCTTTCC 
TATGAGCAACCATACCCGGGAAAGAGTGACTGTAGCCAAGCTCACATTGGAGAATTTTTA 
TAGCAACCTAATTTTACAGCATGAAGAGAGAGAAACCAGGCAGAAGAAATTAGAAGTGGC 
CATGGAAGAAGAAGGATTAGCAGATGAAGAGAAAAAGTTACGTCGATCACAACACGCTCG 
CAAAGAAACAGAGTTCTTACGGCTCAAAAGGACCAGACTTGGCTTGGATGACTTTGAGTC 
TCTGAAAGTTATAGGAAGAGGAGCTTTTGGAGAGGTGCGGTTGGTCCAGAAGAAAGATAC 
AGGCCATATCTATGCAATGAAGATATTGAGAAAGTCTGATATGCTTGAAAAAGAGCAGGT 
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FIGURE 2F 

GGCCCATATCCGAGCAGAAAGAGATATTTTGGTAGAAGCAGATGGTGCCTGGGTGGTGAA 

GATGTTTTACAGTTTTCAGGATAAGAGGAATCTTTATCTAATCATGGAATTTCTCCCTGG 

AGGTGACATGATGACATTGCTAATGAAGAAAGACACCTTGACAGAAGAGGAAACACAGTT 

CTACATTTCAGAGACTGTTCTGGCAATAGATGCGATCCACCAGTTGGGTTTCATCCATCG 

GGATATTAAGCCAGACAACCTTTTATTGGATGCCAAGGGTCATGTAAAATTATCTGATTT 

TGGTTTATGTACGGGATTAAAGAAAGCTCACAGGACTGAATTTTATAGAAATCTCACACA 

CAACCCACCAAGTGACTTCTCATTTCAGAACATGAACTCAAAGAGGAAAGCAGAAACTTG 

GAAGAAGAACAGGAGACAACTGGCATATTCCACAGTTGGGACACCAGATTACATTGCTCC 

AGAAGTATTCATGCAGACTGGTTACAACAAATTGTGTGACTGGTGGTCTTTGGGAGTGAT 

TATGTATGAAATGCTAATAGGATATCCACCTTTCTGCTCTGAAACACCTCAAGAAACATA 

CAGAAAAGTGATGAACTGGAAAGAAACTCTGGTATTTCCTCCAGAGGTACCTATATCTGA 

GAAAGCCAAGGACTTAATTCTCAGATTTTGTATTGATTCTGAAAACAGAATTGGAAATAG 

TGGAGTAGAAGAAATAAAAGGTCATCCCTTTTTTGAAGGTGTCGACTGGGAGCACATAAG 

GGAAAGGCCAGCAGCAATCCCTATAGAAATCAAAAGCATTGATGATACTTCAAATTTTGA 

TGACTTCCCTGAATCTGATATTTTACAACCAGTGCCAAATACCACAGAACCGGACTACAA 

ATCCAAAGACTGGGTTTTTCTCAATTATACCTATAAAAGGTTTGAAGGGTTGACTCAACG 

TGGCTCTATCCCCACCTACATGAAAGCTGGGAAGTTATGAATGAAGATAACATTCACCCA 

TAACCAAGAGAACTCAGGTAGCTGCATCACCAGGCTTGCTTGGCGTAGATAACAATACAC 

TGAAATACTCCTGAAGATGGTGGTGCTTATTGACTACAAGAGGAAATTCTACAGGATTAG 

GATTTCTAAGACTACTATAGGAATTGGTTGGCAGTGCCAGCTGGCTCTTTTTTTTAATAT 

TTTATTATTTTTGTTAACTTTATTATATGAAGGTACTGGAATAAAAGGAACAGACATCCC 

TTTCTAACTGCACTGCCTACATGCGTATTAAGGTCCATTCTGCCTGTGTGTGCTGTGGCT 

TTGAACTGTAACACCTCTAATCAATTCAGGAGAAACACATATCATTTAAAGCAACATAGG 

CTAACCTGTAGGTAACACTGCAGTATTGATGTTTTACTGCAAATCTTATGGGTCTAGATA 

ATCAGTAAAAGCCATCTTCCATAGTTGGTGTTAGAACATTGCCCTATTGGTTTGGACATC 

TGTAGAATATATATGAAGACAATTTCTGTAATGGTTTTAAGAGATTTAAAAAGAAATTCA 

CTGGTTCTTTACAAAATAGAATTTATCATCAAGTTATTACACAAACTTCACAGTAAGGAG 

TGACAAGTTTATAATAAGGAAGACAAAGTTTAACACCTTCACTCAAGCACTCCACTAATA 

TATTTACGTTGCATTCAGAAATACTGATGACCTTCATATACGTAGTCTGTATACTCATAG 

GGAGATGTACTGTATTATATAACATGTAAAGTTGATTTTCTTGTGACAAGAGAACTTCTT 

TTTTTAACAAGAGGACATGGCATTATTTTAATTTGATTATGGTGAGTTGAATTTAAGACA 

TGACCATGAAGGCTGCTTGTAGAATTAGTGTATTTTTATTAAACTATTTTTTTAAATGTC 

AAACTTCTATCATGTAAATGGACTTATAGAGAACAAAAAGCTATTTACTTTGGTTTTCTA 

GAAAGTTGTTACATATCATGGCTGGTTAACTTTTATTTCTTTTGATGAAAATTTTTCCTT 

TGATAGTACTTGTATTATTGTGCCATTATTTTCTTATGCTCCAAATGTACCAAAGATCTT 

GAACAGAGTGGATGTTCACAACTGAGTAGAATTTTCCTTTCCTGTGGGCATGCTGTATTC 

AGACCTGACAGATCTTTGATAGAGGTCAGCTTATTAAAGGGCAATATTGTTCTTGTTTAG 

CTACATCACTGTGGTGAATATAGATGGAATTAAGGAAGTAAATGCAGGCCAGGGGGTTGT 

GATGAGAGGATAGGGGAGATAATATCAGCATCAAATTCTTTGGGTATCTCTCTAAGAATT 

AAATAATCTTTTCTAGCTTAATATTTTAATTCTAATTCAAACAACTCTGAGGTTTTGGTT 

TCATTAGTAATAGTTGAGGAATAATATACTAGCAAAGAATGGCCTAATGTTTGTCATAAC 

TGTTAATGGATGAAATTTTTTAAAGATACAACCATGATAACCATTATAAATGATCTATGA 

TCAAAATCTAAAGTGATGAATTATTTGTAGGAATGTCTTCCTAATGGGGAAGAATTGCAT 

AGGAGCATTATGCAAATCTACACAAGCTTTTATAAATGTTGCTGCTGGGTAGCTCCACAG 

TGTTTCATAAGGCCATCCTGTTTCCCCCAACTCCCCCATTTTTGGTTTGTTTCTTTTTAA 

ATATTTGTTGAGTACTTACGTGTTTATCTAACAGTTCACTTCCATTTTTCTAGTCTGGAT 

TTTTTGAGTATTTAGGAAAGAGAGCTATTAAAAACTCTGGGGATTTCTCAATGTGACTAA 

CTCTAATTTTTCTAATTATAACTGCCTTTAATTAACATAATATTAACTTTTGCTGAGGTT 

TATGAGATTTTCTCACCCCACATCGCTCCCCTTTTTTTAAAAAGGACTGTTTTGCTAGTG 

TGATAATGAATAGGTAAGATATGAGATAATTGCAACATTGTCCTAGTTCTAGTATGGTAA 
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FIGURE 2G 

CTATTCTTGAAATGGTATTGAAAAATACCGTTAATTCAAATTGACAGAGATTGATAAAAA 

GAAACTGATTTACCTAAGTTTACTTTTTAATTGCATAATAGAGCATTTTTTGTTTTGAGT 

TCCCTCATTCTTATTACCAGAAAGAGCTTGCAAATAGTTTTACTTTCTTGGCACTGGAAG 

GGTAGTTCTGGAAAGCTACTTTGTTGAGAGTCTCATTCTTCCCTGGAGTTAATAGAGTGA 

TTCACAATCTTTGGGGTTTTCTCCTCATCAAAAGCATTTCTTAAGTGCCTATCTAAAAGC 

AATTAAAGACTGTGTCTGCCCTTTAGAAGCTAAGAATTTGATTCATGATGCAAATTAACT 

AGATAATTTGCAAAGTACCCTTGAGATTGAATTTTCTCTATTATATATTTCCCATATTTC 

AGGTGAATAATTTAATTTAAATGACAAAACCCTATCTAGTCAACTGGGCATAATGACATT 

TTCTTTAAATTAGACTCTATTTTGAATTAAAAGAGTTTTATTATAAACCGTGTGTTTTTG 

GTTTTTCTAAGTATATAGAAAGCTTGTATAATTCAGATTTATCAATTTCCTGATTTAATG 

TAGACTTTGACTTTTTTATTAAAAACCTTTGTATTAAAGCAAGTTATGTTATTTTTCTTT 

TATGCATTTATTACTAACATAGCTTTAAATCTTTAAATGTATTGAAGCATTGTGCTGTCT 

GAAAATAAGGAATTGCTTATAAACCAGCCACTTCTGAATACAATATGTAGCTGATTTAAT 

AAGCTAGTTAGTGAATGGAAAATAAGTGTGGAGTATTAAAAATGTTCTTTGGTTGGTAAG 

GCCTAAGATAGGGTTTCATTTATTTCTATACTTTTTCTGTTTTTTAAACACCTGCATATT 

TTTATGTAAATCTCTAAATTTAAAATATTTTAAGTACATTTATTTTTGGTGTTTTATTGT 

ATAAAACCTTAGACAATCAATCAGTCAGTCTTTACTGACAGGAGCAGCAGCTATCTGTCT 

TTTGCTGATCTACAAATAAATGAATTGAGAATTTAGTCCATAGAGGTCCCTGGCTACCAA 

ACACATTCTCCTTTGAATTGTTAAAATTCAGAACATTCAAAATAACTGTTTTGCTACAAC 

CCATGATTATTTTCCTGTTGTGTTTATTTAAATTTACTTTCTCTTTAGAAGTGCACTTAT 

TTCTGAAAAATCTTAATGAAACAAACGCTTAGAACAAATATAAATATGAGACACTTGGGA 

CTACTAGAGATATTTTAGATTTTTATGAAAAAAATGTGAGGGGATATTGCTGCTTTAAAA 

AGGAATAAAGTAATAAAAATATATCTCAGCTATTTTTTTAAAGCAATATAATTCAGCAAT 

TGTCTAGAAAAGTAATCATGAGGCTACTGAGTTTGGTGTTCAGTTACTGAGTTTCAAAAA 

TGTTTTGGTGGCATGAGGACAAAATTTCATTGAAGGTAAGATAAGAATAAAAACTATGTT 

TAC 

SEO ID NO: 9 AA210825_H 

CACGAGGGCTACTGGCGCCTGGCGACCCTCCCTGCCCCCCACCCAACCCCGCTCCGGCAA 

CGCCCCCTTCCTCACGGCTCCCGACCGAACTTTTCTCCAACTTCTGCGACTCGTGAGATT 

CCCTTCTACCCACTCCGGCCCTCGGGACCCCTCTGCCCATCCCCTGGCCGGTCGGGTCCC 

TGCGAACCCCTTTATCTCTGGAATCCACTCGGTCCCCGACTCAGAGACTCCTGCCCTCCA 

CCCCCAAGGACCCCGCCATCCTCAGGTCCCCTCCGCCTGCCAGATCTTTTCTCGGATCCC 

CGCTCTCCCACCACCTGCTCACGAGATCCCGCGGATCTAGAACCCAGGGTCCCCCGGGGC 

CCCCCGGCGGGTCCCGGGTGGGCTCCAGGCGGGCGGTCCCCGGCCTCCCCCCATGGCCAC 

CGCCCCCTCATTATCCCGCCGGGCTCCCTGGCTCTCCCGGGCCGGGGTCTCCTCCGCCCC 

CCGGCGGCCTAGAGCTGCAGTCGCCGCCACCGCTACTGCCCCAGATCCCGGCCCCGGGTT 

CCGGGGTCTCCTTTCACATCCAGATCGGGCTGACCCGCGAGTTCGTGCTGTTGCCCGCCG 

CCTCCGAGCTGGCTCATGTGAAGCAGCTGGCCTGTTCCATCGTGGACCAGAAGTTCCCTG 

AGTGTGGCTTCTACGGCCTTTACGACAAGATCCTGCTTTTCAAACATGACCCCACGTCGG 

CCAACCTCCTGCAGCTGGTGCGCTCGTCCGGAGACATCCAGGAGGGCGACCTGGTGGAGG 

TGGTGCTGTCGGCCTCGGCCACCTTCGAGGACTTCCAGATCCGCCCGCACGCCCTCACGG 

TGCACTCCTATCGGGCGCCTGCCTTCTGTGATCACTGCGGGGAGATGCTCTTCGGCCTAG 

TGCGCCAGGGCCTCAAGTGCGATGGCTGCGGGCTGAACTACCACAAGCGCTGTGCCTTCA 

GCATCCCCAACAACTGTAGTGGGGCCCGCAAACGGCGCCTGTCATCCACGTCTCTGGCCA 

GTGGCCACTCGGTGCGCCTCGGCACCTCCGAGTCCCTGCCCTGCACGGCTGAAGAGCTGA 

GCCGTAGCACCACCGAACTCCTGCCTCGCCGTCCCCCGTCATCCTCTTCCTCCTCTTCTG 

CCTCATCGTATACGGGCCGCCCCATTGAGCTGGACAAGATGCTGCTCTCCAAGGTCAAGG 

TGCCGCACACCTTCCTCATCCACAGCTATACACGGCCCACCGTTTGCCAGGCTTGCAAGA 

AACTCCTCAAGGGCCTCTTCCGGCAGGGCCTGCAATGCAAAGACTGCAAGTTTAACTGTC 
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ACCCAC^CTGCTTTGAGATCGTCACT^ 

TTGACAAACTGCGCTTCCCTACCAAGCAGGAGAGC^ 

TGCAGAGCCTGCGGCATCCCGGGATCGTGAACCT^^ 

AAGTGTTTGTGGTGATGGAGAAGCTGCATGGG^ 

AGAAGGGCCGGCTGCCTGAGCGCCTCACCAAGTTCCT 

TGAGACACCTTCACTTCAAGAACATTGTCCACTGTG 

TGGCATCAGCAGACCCATTTCCTCAGGTGAAGCTGTGTG^ 

TCGGCGAGAAGTCGTTCCGCCGCTCAGTGGTGGG^ 

TCAGCCACCCCTGGTTACAGGAGTACCAGACGTGGCTG^ 
CCATGGCCTTGATCTCAGCAAAA 

SEO ID NO: 10 - AA12729 ^^»r !1 TTTrcTTGCTATGGATTCAAATGGTCTTTCT 
GAATTTGCAGGAAAA 

SEQ ID NO: 11_AA316804 H GTCTGTATTACC CACAGCTATTCCT 
ATGTCTGCAAATAATTCCCCTCCATCAGCCCA^ 
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FIGURE 21 

CCAAATAACTGTAGTGGAGTAAGAAAGAGACGTCTGTC^TGTA^ 
GGCCTCTCAGTTCCAAGACCCCTACAGCCTGAATATGTA 

CATGTCCACCAOSAACC^^ 
GAAAAGATGGTAATGTGCAGAGTGAAAGTTCC 

CGTCCCACGATATGTCAGTACTGCAAGCGGTTACTGJ^ 

CAGTGTAAAGATTGCAAATTCAACTGCCAT^ 

TGCCTTGGAGAGGTTACTTTCAATGGAGAACCTTC 

CCAATGGATATTGACAATAATGAC^ATAAATAGTGATAGTAGT 

GAAGAGCCATCACCCCCAGAAGA^ 

GAAAGAGATGAAGAAGCCGTTAAAACAAi'-^ GCAG CACAATGGTGAAGGAA 

CAGAGCTGGGAAAAAGCAATTCGCCAAGCCCTCATGCCTGTTAC 
TGCACTTCTCCAGGGCAAGGGAAAGAT^^ 

ATGTTTGAAACCCCAGAAC^GTCTTTGTAGT^TGGAAW^TG ^^^^^^^^ 

GCATACTTAGCCCCTGAAGTTCTCCGGAGCAAAGGTTAC 

TCAGTGGGAGTTATCATCTATGTGAGCCTCAG^ 
GATATAAATGACCAAATCCAAAATGCTGCA^TATGTACC 

CCTAATCCAGATGATATGGAAGAAGATCCTTAA 
ATGGAGGAGGGGGCGCK^CGGCAG^ 

CAGCTCAGGGCTCGGCACCTAGAGGCTCTCCGGAGGC^A 

AAACAGGGGGCTGAGAACATGACCCACACGTGCGCC^ 

CTCCTTGCAGCTGCCCAGCAGATGCT^ 
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FIGURE 2J 

GTGAAACTGCTTAGTAGCCGGAG^CACAGGACC^ 
CAGCTACAGGAGTCCTCTCAGAAACTGGACCTCCTGCGCCTGGCCT^ 

GAGCAACTGCCTCCTGCCCACCCTTTGCGCAGCAGACTG^ 

GTGCCTGGATACCCCCAGCCTTCAGGGACACCTGTGAAGCC^ 

CTGCAGGTCCGCCTCCTGGGCTGTGAACAGTTGCTGACAGCCOT 

GCGGCCGCACTGGCCAGCAGCCCCTCCGAGGGCTGGCT^ 

CGTGGCCGAGGCGAGCTTGCCAGTGAGGTGCTGGCTGTGC1AAA TTTGTC ATC 

GTGGGGCAGACGGGCTGGGGGCAGGTGGCCGAACAGTCCTC 

CCACTGGAGCGAGCCCGTGAGCTGGAGATTGGGGTACACTC 

TGTGGCGTGGCCTTCCTGAGACTTC 

CTCAGCCTGGTACCGCAGGGACTGCTIiii TCTCTAAAC GCAGAGGCCAGGAC 
ACAGGGAAATAGTAGGGGATCAAAGCA^^ 

ATTTACAGGGACCTGAAGTTGGATMCCTTCTGCTGGATGCC CACCXTCT GT 
GCAGACTTTGGACTCTGCAAGGAAGGGATCGGCTTCGGGGACC^ 

GGCACCCCGGAGTTCCTGGCTCCCGAGG^ 

GACTGGTGGGCGCT^GTCTCCTCCTCTA^GAGATGC^^^^^^^^^^^^^^ 
CGCCTGGGGGCAGGTGAGCAGGATGCCGAGGAGAT^ 

ACCAACTGGCAAGCCCTGCTCGCCCGCACCATCCAGCCCCCL CTGCCC TGACC 
TTTGTGTCAGAGCGATTCCTGGAACCCTGA 

SEQ ID NO: 13_AI021023_M P ^^ r _ ATTCCTGAAGATCG CAGACTTTGG 

GCGATTCCTGGAACCCTGAGGGCATCTC^ 
TTCTGAGCCCTTGGGATTCAAAGT^CAGCCATGG 

:>a/ir> 
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FIGURE 2K 

ACTTGCTTTATATTAAATTTGTAAAAGTG 

GGTGGCAACATCCGGGGTCCCTGGGCCCGAGGCTGGAAGAGCCTCTGGACAGGTT^ 
AcS?CA^G?CAGAT^ 

rRPTTCATCAACCTCTTTCTACCAGAGTTTCCCATTAGGCCCATTAGGGGGCAGCAGCAG 

SgIIgaS™ggcctcgtggctaaaggct 

TGCACCCAGAAAGCTGTATTTGCAGTGAAGGTGGTGCCCAAGGTAAAGGTCCTACAGAGG 

gSaccSga^cagtgcaaagaggaggttagca 
gtIcacagS?g^ggacagct^ 

?gcagcacagatctgtactccctttggtcggctgttggctgctttcctgaggcttccatc 
^^^Sgctgccgagttggtgctggtactgtgttatctccatgacttgggcatcatg 

catcgagSgtgaagatggagaatattct^ 

racTTTG^TCTGTCCCGCCACGTGCCCCAGGGAGCTCAAGCCTACACTATCTGTGGCACT 

?gg?c?c?ggg?otcttgcttttctctct 

^^rAGGGCCTCTCACTCCTGCTCCATGAGCTCTTATGCCAGAACCCCCTCCATCGT 

c?Ic^™???gS5ca?Sccaggtccacc 
gagSSacI^ 

?CAGCGG^^ 

cctatccctgcttga 

Sgagaaaaattctatagtctaaaatcagaacctttgaaaccattctttactcttccag^ 
ggaga^gcttctaggagttttaatactagtgaaagcaaggtagagtt^^ 

GACACCA?^ 

?t?gaSSgtcagtggtactgatgaagg^ 

gaISggIS?^ 
™gggatSt^aaaataaactc^ 

actgaa^aa^gccaagcacatgaggagaaaggcatagaggaactgagtgatccctctggg 
c^caaI^a^ 

Sag^gctStgatcatagtagttcaggagatatg^ 

^ATTCCOTATTTGTAGTCCACTCTCAGGTGCTAATGAATATATTGCAAGCACA^ 

SaaaaISgaagaagtattgctgtttacagatcagactgatgatttggctaaagaggaa 

AC^GGAAATACATCAGATTTTTGAGGACCTTGATAAAAAATTAGCACTAGCCTCC 

agg??™ca^cagagS 

GATGCTTTACATAGAGAGGGAATTGTGTGCCGCGATTTGA^ 

aa^SIgaggacacattcagct^ 

^gacagcgSg^^ 

gaaIctgI^ 

aa^Stccttgaatgccatccagcaggaataaatactcacactactttgaa 
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GAACGACTTGGTGCTGGAGTTGCTGGTGTTGAAGATATCAAATCTCATCCATTTTTTACC 
CCTGTGGATTGGGCAGAACTGATGAGATGA 

SEQ ID NO: 16_AA626690_H 

ATGCTACCATTCGCTCCTCAGGACGAGCCCTGGGACCGAGAAATGGAAGTGTTCAGCGGC 

GGCGGCGCGAGCAGCGGCGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG 

GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTATTACTCATCAT 

GTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTGAGTTGCTCAAGGTTCTTGGT 

CAGGGGTCATTTGGAAAGGTTTTTCTTGTTAGAAAGAAGACCGGTCCTGATGCTGGGCAG 

CTCTATGCAATGAAGGTGTTAAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACA 

AAGATGGAGAGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTAT 

GCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGGGAGGAGATGTT 

TTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAAGATGTGAAATTCTACCTCGCA 

GAACTGGCCCTTGCTTTGGATCATCTGCACCAATTAGGAATTGTTTATAGAGACCTGAAG 

CCAGAAAACATTTTGCTTGATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGC 

AAGGAGTCAGTAGATCAAGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATATG 

GCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGGTGGTCATATGGT 

GTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTTCAAGGTAAAGACAGAAATGAG 

ACCATGAATATGATATTAAAAGCAAAACTTGGAATGCCTCAATTTCTTAGTGCTGAAGCA 

CAAAGTCTTCTAAGGATGTTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGA 

GTTGAAGAAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATATAAA 

AGAGAAGTTCAACCTCCTTTCAAACCTGCTTCTGGAAAACCAGATGATACTTTTTGTTTT 

GATCCTGAATTTACTGCAAAAACACCTAAAGATTCTCCCGGTTTGCCAGCCAGTGCAAAT 

GCTCATCAGCTCTTCAAAGGATTCAGCTTTGTTGCAACTTCTATTGCAGAAGAATATAAA 

ATCACTCCTATCACAAGTGCAAATGTATTACCAATTGTTCAGATAAATGGAAATGCTGCA 

CAATTTGGTGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTTGC 

AAGCGATGCATACATGCAACTACCAACATGGAATTTGCAGTGAAGATCATTGACAAAAGT 

AAGCGAGACCCTTCAGAAGAGATTGAAATATTGATGCGCTATGGACAACATCCCAACATT 

ATTACTTTGAAGGATGTCTTTGATGATGGTAGATATGTTTACCTTGTTACGGATTTAATG 

AAAGGAGGAGAGTTACTTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCT 

AGTGATATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGTTGTT 

CATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGCCAGTGCAGATTCA 

ATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTCGAGGAGAAAATGGACTTCTCTTA 

ACTCCATGCTACACTGCAAACTTTGTTGCACCTGAGGTTCTTATGCAACAGGGATATGAT 

GCTGCTTGTGATATCTGGAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACT 

CCATTTGCTAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATGGA 

AAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCAAAGGATTTGCTT 

TCCCATATGCTTCATATGGACCCACATCAGCGGTATACTGCTGAACAAATATTAAAGCAC 

TCATGGATAACTCACAGAGACCAGTTGCCAAATGATCAGCCAAAGAGAAATGATGTGTCA 

CATGTTGTTAAGGGAGCAATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCAA 

CCAGTCCTAGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAGCGA 

ACATCAACTGGCCTGTAA 

SEO ID NO: 17 AA215680_H 

ATGAGCCTGGTGGCCTGTGAGTGCCTGCCCAGCCCCGGCCTGGAGCCTGAGCCTTGCTCA 
CGAGCACGGTCCCAAGCTCACGTGTACCTGGAGCAGATTCGCAACAGGGTGGCTCTGGGA 
GTGCCTGACATGACAAAACGTGACTATCTGGTGGATGCGGCCACGCAGATCCGGCTGGCC 
CTGGAGCGCGATGTTAGTGAGGACTATGAGGCGGCCTTCAACCACTATCAGAATGGCGTG 
GACGTGCTGCTCCGTGGCATACACGTTGACCCCAACAAGGAGCGACGTGAGGCTGTGAAG 
CTGAAAATTACCAAATACCTGCGGCGGGCAGAGGAGATCTTCAACTGCCACCTGCAGCGG 
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CCGCTGAGCAGTGGAGCCAGCCCCAGCGCGGGTTTCAGCAGCCTGAGGCTCCGGCCCATT 

CGCACGCTGAGCTCTGCCGTGGAGCAGCTGAGGGGCTGCAGGGTGGTCGGGGTCATCGAG 

AAGGTGCAGCTGGTCCAGGACCCGGCAACCGGAGGGACCTTTGTGGTGAAGAGCCTACCC 

AGGTGCCACATGGTGAGCAGGGAGCGGCTGACCATCATCCCACACGGAGTCCCCTACATG 

ACGAAGCTGCTCAGGTACTTTGTGAGCGAGGACTCCATCTTCCTGCACCTGGAGCATGTG 

CAAGGAGGCACTCTCTGGTCCCACCTGCTCTCCCAGGCGGACTCCCGACATTCTGGGCTC 

AGCTCTGGCTCTACCCAGGAGAGGATGAAGGCTCAGCTCAACCCCCACCTCAACCTCCTG 

ACCCCAGCGAGGCTTCCCTCAGGCCATGCCCCTGGCCAGGACAGAATCGCCCTGGAGCCT 

CCTAGGACTTCTCCGAACCTTCTCCTAGCTGGGGAGGCCCCATCCACCAGACCCCAGAGG 

GAGGCTGAAGGTGAACCCACAGCCAGGACCAGCACCTCTGGCTCCTCGGACCTTCCAAAG 

GCCCCAGGTGGCCACCTGCACCTTCAAGCTAGGAGGGCTGGCCAGAACTCAGACGCTGGG 

CCCCCTCGGGGGCTCACTTGGGTTCCTGAGGGGGCCGGCCCGGTGCTAGGGGGCTGTGGC 

CGAGGCATGGATCAGAGCTGCCTGTCAGCAGATGGGGCCGGCCGGGGCTGTGGCAGGGCC 

ACCTGGAGTGTGAGAGAGGAGCAGGTGAAGCAGTGGGCGGCAGAGATGCTGGTAGCGCTG 

GAGGCGCTGCACGAGCAGGGGGTGCTGTGCCGGGACCTCCACCCCGGGAACCTGCTCCTG 

GACCAGGCAGGTCACATCCGGCTCACATATTTTGGCCAGTGGTCAGAGGTGGAGCCCCAG 

TOCTGCGGGGAGGCCGTGGACAATCTCTACAGCGCCCCAGAGGTGGGTGGGATTTCCGAG 

CTGACGGAAGCCTGTGACTGGTGGAGCTTTGGGTCTCTACTGTATGAACTGCTGACGGGA 

ATGGCACTGTCCCAGAGCCACCCTTCAGGAATCCAGGCCCACACCCAGCTCCAGCTGCCC 

GAGTGGCTCAGTCGCCCAGCGGCCTCTCTGCTGACTGAGCTGCTGCAGTTCGAGCCTACC 

CGGCGCCTGGGCATGGGAGAAGGTGGTGTCAGCAAACTCAAGTCCCATCCCTTTTTCAGT 

ACCATCCAATGGAGCAAGCTGGTGGGGTAA 
9FO ID NO: 18 SGK H 

ATGACGGTGAAAACTGA5GCTGCTAAGGGCACCCTCACTTACTCCAGGATGAGGGGCATG 

GTGGCAATTCTCATCGCTTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAG 

AAGATTGCCAATAACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATC 

TCCCAACCTCAGGAGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCT 

CAGCAAATCAACCTTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTC 

TTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCAGAA 

GAAGTGTTCTATGCAGTCAAAGTTTTACAGAAGAAAGCAATCCTGAAAAAGAAAGAGGAG 

AAGCATATTATGTCGGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTG 

rrCCTTCACTTCTCTTTCCAGACTGCTGACAAATTGTACTTTGTCCTAGACTACATTAAT 

GGTGGAGAGTTGTTCTACCATCTCCAGAGGGAACGCTGCTTCGTGGAACCACGGGCTCGT 

TTCTATGCTGCTGAAATAGCCAGTGCCTTGGGCTACCTGCATTCACTGAACATCGTTTAT 

AGAGACTTAAAACCAGAGAATATTTTGCTAGATTCACAGGGACACATTGTCCTTACTGAT 

TTCGGACTCTGCAAGGAGAACATTGAACACAACAGCACAACATCCACCTTCTGTGGCACG 

CCGGAGTATCTCGCACCTGAGGTGCTTCATAAGCAGCCTTATGACAGGACTGTGGACTGG 

TGGTGCCTGGGAGCTGTCTTGTATGAGATGCTGTATGGCCTGCCGCCTTTTTATAGCCGA 

AACACAGCTGAAATGTACGACAACATTCTGAACAAGCCTCTCCAGCTGAAACCAAATATT 

ACAAATTCCGCAAGACACCTCCTGGAGGGCCTCCTGCAGAAGGACAGGACAAAGCGGCTC 

GGGGCCAAGGATGACTTCATGGAGATTAAGAGTCATGTCTTCTTCTCCTTAATTAACTGG 

GATGATCTCATTAATAAGAAGATTACTCCCCCTTTTAACCCAAATGTGAGTGGGCCCAAC 

GAGCTACGGCACTTTGACCCCGAGTTTACCGAAGAGCCTGTCCCCAACTCCATTGGCAAG 

TCCCCTGACAGCGTCCTCGTCACAGCCAGCGTCAAGGAAGCTGCCGAGGCTTTCCTAGGC 

TTTTCCTATGCGCCTCCCACGGACTCTTTCCTCTGA 
qFO ID NO- 19 AA107515 M 

rrGGTCGACCCACGCGTCCGCCGGTTTCACTGCTCCCCTCAGTCTCTTTTGGGCTCTTTC 
CGGGCATCGGGACGATGACCGTCAAAGCCGAGGCTGCTCGAAGCACCCTTACCTACTCCA 



Wl II) 



PCTAJS00/14842 

WO 00/73469 

FIGURE 2N 

GAATGAGGGGAATGGTAGCGATTCTCATCGCTT^ 

ACGATTTTATTCAGAAGATTGCCAGCAACACCT^ 
CCATTTTGAAAATGTCCCATCCTCAGGAGCCGGAGCTTATGAA 
CGCCAAGTCCCTCTCAACAAATCAACCTGGGTCCGTCCTCCAAC^ 
CCGACTTTCACTTCTTGAAAGTGATCGGAAAGGGC^AGTTTTGGAA 

GGCACAAGGCAGAAGAAGTATTCTATGC^ 
AGAAGAAAGAGGAGAAGCATATTATGTCAGAGCGGAATGT^ 

ACCCTTTCCTGGTGGGCCTTCACTTCTCATTCCA^G 

TGGACTACATTAATGGTGGAGAGCTGTTCTACCATCi TCTGCAC TCCC 

AACCACGGGCTCGATTCTACGCAGCTGA^TAGCCAGT^ 

TAAACATCGTTTATAGAGACTTAAAACCTGAG^ATATTCTC 

TCGTCCTCACTGACNTATTTCAGCTGCGTAGAATCGAGCAT 

TCTGTGGCACGCCTGAGTATCTGGCTCCTGAGGTCCTCCATAAO 

CGGTGGACTGGTGGTGTCTTGGGGCTGTCCTG^TGAGATG 

TTTATAGCCGGAACACGGCTGAGATGTACGACAATATTC 

AACCAAATATTACAAACTCGGCAAG^GCACCTCOT 

CCAAGAGGCTGGGTGCCAAGGATGACTTTA^G^Ai^^^ cccATTT ^ ccd ^ TGTGA 



GTGGGCCCAGTGACCTTCU^ia^^^^---^ aagGAAGCAGCAGAAG 
CCATCGGCAGGTCCCCTGACAGCATCCTTGTCACGGCCA^ 

CCTTCCTCGGCTTCTCCTATGCACCTCCTGTCGATTC^ 

====== 

CAATTTTGGTTTGCAATAAAATCTTGAAAACT 

W"3 



PCT/US00/14842 

WO 00/73469 

FIGURE 20 

GAGGCTAACTCCACCCTTCAACCCAAATGTGACAG^ 

cccaSotcacccaggaagctgtgtc^ 

CAGCAGCTCTGGGGCCTCAAGTGCA^ 

PMCTTGGATTGCTAGAAGAGAAGGACCTGTGAAACTAUi TCAACGAG AAG 

agga!t?accttcagctgctaggaagagcgactcaa^c^ 
aagctccacccaatgagttctgctt™^ 



aSgtatcaatatctcttat-tgttct 



S^gga^S 

AC^GGAAAGCTGCCCAAGTGTAAGNATTCCCAGCTCC^ATG 
CTGTCAAAGTGTTACAGAAAAAAATAGTTCTCAACA^ TGGAXTGCA TTATT 

tSaatca^cagctgggctgacc^acaa^gaa^ 
tcI?taWttatattcttgtttaataag^ 

ko/ii) 



PCTAJSOO/14842 

WO 00/73469 

FIGURE 2P 

ggagttggagaccagcctga^ 

^^^^^^^^^^ 

CCTGGGCAACAAGAGTGAAACTCCATCTCC 



SEQ ID BO: 22_R47805 H GAGCTGRRGGA ATTCTTTGCCAAGGCR 
ATGGCGCACCAAACGGGCMCCACGCCRCGGMGAG^ GCAGCTCGTG CTGGGT 

GCCTCGCAGGAGCCAGTACMrcGCTGGGATCA^ CTCACAGARTGC TCAGGGG 
CTGGACGCGCAGCAGCCCTGCTACCTGCTCTAC^ 

TTCGAATGGCTCTTCCT^CTGGTCGCC^AA^ CCACATCAAGG ATGAG 



tacgcggccacgcggg™tgaa~»^ ; ACC AGAAACACCTGTCGTGC 

CTCTTCGGGACTGTGAAGGATGACCTCTCTTTI^IU AGGAGAT CGGCATTAAC 

CCCCTGGAGGCTGAGGCCCAGCGGGCACTCCAGC* CTGGTGCACAC AGAGCCCACG 
ATCCAGATGAAGCTGGACCTAGAGCGGGAAACCA^TGAGCTGG^ 

GATGTGGCCCAGCTGCCCTCCCGGGTGCCCDGAGA GTGTTCATC TACTCCATGCCG 
TACAAGCACACCCATGAGGGCGACCC^TTGAOT^^^^^^^^^^^^^^q Gg ^, gg ^. g 

GACTCCGTGGAGCAGGACTTCCATCTGGAGATCGGCAAGAAAM TTCAAG 
GCAGAGCTGACGGCAGAGTTCCTCTACGACGAGGTGCAC^ 



ScCCGGGTGAAAATGGGGATGACAGCTAG 

SEQ ID NO: 23_H60215_H rr r GGC GGCGGCGGCGGCGGCGGCTCGGG 

ccLgcgtccggcgccgcagccatggagggag^ 

TGGCTGCGCTGGGAGGCGGCGGTGAGAGGCT 
CGGGAGGGAGAGCCGAGCAGCCCCG^ 

ccacccggcgaagtgcacaca^ 

ACAATATATGTGCTCTGCTCTCCT^ 
CACCAGGTCT(^TTCCAGACTCCTCCCCAC^ 
.ACCACAGACCCATTCAGGGAGGCTGGCG^ 
TGTCAAAGCTGAGAATGAAGCGGAGAG^ 



GGCCAAGGCTCTAGGAAGTGGGATTTCTGGA^ 
TCCTTGGTCCCCGTCTGGGCAACTCACCG^TGCCAAGCA^ 

AAGATGGCACGGATGACTTCTATCAGCTG^GATCC^ 

TOTTCMCCTGCAGCACTACGTCATCAAG^GAAGAGGCTCAGCG^^^^^^^^^ 
<»GACCTGAAGCTGGG<MCAT^TCCT^CAAQ^ CTGAAGGA cCAGAGAGGGA 

U«/ /I) 
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FIGURE 2Q 

AGTCCCTGTCATCTCTGAGTGGGCCTTTGCAAGTGGTTCCTGACATTGATGACCAAATGA 
GGAGCTGGGTACCCAAGCGGCAGTTCGGCAGCGCACCACCGGTGCGACGGCTGGGCCACG 

acg?acag?™^ 

AACAGCCTCAGCCGGGGCCACCAGCACTGCTGCCACTTCTTCCAGCCCCAGCCAAAGGCG 
TGGCTGTCAGGGCTGGGCCCTGTAGTGCTGGACTCTCCCGGGCCACAATAGGGACAGGG^ 
ArrGACAGGGACAGCCCAGGTCACACGTGGGGTCAGCAGAGGTACCACGAAGCTACCTTT 

tcggaSSgctcgaS 

TTTA^TCACGATGTTTTAATCTACCTCTGTCTCTTTAACCATGCTGTCTCTGGACTGA^ 

aIgSaggLggagcctgctcaccccactccagggccttccccagcggccac^ 

ATCTGGGGCGCTGCTCCCCACAGTCCAAATAAGCTGAAAGTGCAGCTCGCTGCAGGCCCC 
AGAGCGAGCTTCCCCTCCTCCCTGCTCTCCCAGGCCCCTG 

^ScSctgSccLgcccctcagtccaagctttggaaaaccttcac 

CAAACTCAAATATATTTATTTTTTTACCAT 
rrcGCGATGGCCAGCACCA^ 

?cg?ggccgggg??gcg 

cccSg^gaacgggctcatccccagtccggcgcacagtgcccactgcagcttctaccgc 
acgc^gaccctgcaggccctcagctcggagaagaaggccaagaaggcgcgcttctaccg^ 
^ggggaccgctacttcaagggcctggtgtttgccatctccagcgaccgct 
^cga?g^c??c?catagagctcacccgctccctgtcggacaacgtg 

gg?g??2g?aSatctac^ 
??ggSggSagagtt^ 

aa^aaa^attaatccaaactggtctgtgaacatcaagggtgggacatcccgag 

g^ctcctogtgaaaagtgaagt^ 
SgaSgtgIScgaagtggagtgaag 

SgaSgctcattcctttgaacaagtcttaacagatatcaccgaagcca^ 
™gIgtcg?caa^ggctctgcaccctggatggaa^ 

CgSSgCcS^GATGAC^ 

ISccagSttctgctcatggcagatcttcttccaatgtaaacggtggacc 
cgSSStaagtcctgaaggtgtgaatggaaacagatgctctgaatcatcaact 

SgIa^a^ 

atagacaggtccactggaaaggagtttgccctaaagattatagacaaagccaaatgttgt 
gga^ggaIcacctgattgagaatgaagtgtcaatactgcgccgagtgaaacatcccaat 

GTCAAAGGTGGAGATCTCTTTGATGCAATTACTTCGTCGACCAAGTACACTGAGAGAGAT 

ggc^gccatggtgtacaacttagccaatgccctcaggtatctccat 
g?gcacagLacatcaaaccagagaatctcttggtgtgtgaatatc 
tctttgaa^ctgggagactttgggcttgcgactgtggtagaaggccctttatacacact 
^tggca^cccacttatgtggctccaraaatcattgctgaaactggctatggcctgaag 

SggacaStgggSgctggtgtgatc^ 
cgSgtg!gaI2S?ctccaggaagatctcttcgaccaga 



us/115 
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FIGURE 2R 

Stagccggc^gctgccctcaca^gttggtotttcc^ ^ctgtgt 

^TI(^TIGGCATGTG^GGG^CT«TTCGSC^^ TCCAGGC GTCRAGGAGCTC 

TGAGATGTCAGTGACAACGATGGCCT^m^ 
AGTAAATGAATCAAAAGGAAAAAACATTGAA^CAG^ 

GCTGTCCAGGCCTGAGCCAATGCAACCCTGGGCGGGAft ^ctccCAGGAGCAG 

agIagccagccagccacccaggc^^ 

TTCTCATGGTACATGGTTGTCTGAMGCCTTACCTA CCCRflCATCC CCCACACTCC 

atcaagcctgagaatctgctggtgtgcgaataccc ggtctgtggcacgcca 

TGGCAGTTTTCCGTGTT^CCAC^CCC^CTCT^^^^^^^^^^^^Q^^-f 
TGCCTTTTCCAGCTATCCCCA^CTCCAATAAACTCTTCCCTCTTCAG 

c:fo ID NO: 2 6 AA3 83293_H ^^„ rrrrACCC ATTCTTCCCAGGCTCCCAG 



PCT/US00/14842 

WO 00/73469 

FIGURE 2S 

cac^ctcccccctt^c™ 

TCAGAAACAAGCAGAGACGGGCA^^ 

TTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGG^AGA 

GACATGGAAATCTACCTGATCCTGGAGTACGTGCAG^ 

ATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCT^ 

GCCCTCGTCCACATGCACGACAAGAGCATTGTCCA^ 

TTGGTTCAGCGAAATGAGGACAAATCTACTACCT 

AAGCATGTGGTGAGACCTATATTTACTGTCT 

ATTCTTTCTGAGAAAGGTTATGGACTGGA^ 

TATATCCTGCTGTGTGGCTTTCCCCCATTCCGCAGCCL TGG GACAATATC 
CTCTTTAACATCATCCAGCTGGGCC^ACTTTGAG^TTCCTCC 

TCTGATGCTGCTAAAGATCTGGT^ 

ACAGCTCATCAGGTTCTTCAGCACCCCTGGAT^ 

AAACGACAGAAGCAGGTGTCCCCCAGCAGCGATGOTCACT^ 

GTTGTGGAGCAGGTATCATAGTCACCAC^ 
AAGGACAGAGAAAAGGATAGAAGTTTGAGAG^AAAAA 

ATTGGTGAATCAGAGGGAGAGACACTGAGTATAT^ 

caatgttaaatgtcacaacata™ 
ctgtgagattaataaggtgcattg 

SEQ ID NO: 2 8 -^i^^^CCGCCGCCGCCGCCAGCGACCCCCGCCCCGCCGGCA 
ATGCCAACCGCGCCGGTCCTGCGCCCGCCGCC^^ 

CCCAGTCGCCCTGCGCCTCCCATTCCGGGCCACCG^ 

TGCTTAAGCTCGAAGATCTCTGAGAGAAAGCTGC^ 

GGACCTCTGGAGAAGCCAGTTCTGGGGCCACGTGGTG^ 

CAGAGCAGCCTCCACTCAGTCCGCGCAGAG^ 

GTGGTGAAGCTGGGTGGGCAGCCCCTCCGTAAGGCC^ 

GTGCAGACCTTTGAGCAGCTCCTATCAGACATC^ 

AAGAACGACCGTGTGCGGAAGCTGTTC^ 
GACTTCTTCCGGGAGGGTGATGCTTTCATAGCT 

CACAGTAGAGTCCCCTC^ 

Up 



PCT/US00/14842 



WO 00/73469 



FIGURE 2T 

AGTCACCGCTGTGGGGAGGCAGGAAGCTATAGCGCGGA^TGM^GT^GGCAGTCTCT 
AGGCATCAGGGCAAGACTTCCACAGTGCTCGCCCCAGAAGACAAGGC^ 
TGGGTAAGAGGGAAACAGGAGTraGAACCTGGTGGCCCGCCTT ^GftCCTCC 
CAGGAGGAGACTCATGCAA(HX3(»GAGAAACATCTGG^GGAGAT CRG 

GAGGAGGGGCCGATAGACATGAGGAGAGAGG^ 

GCCTGGCTCCGGAGAGAGCAGCAGGCC^ 

GAGAAGCAAGCAGAGCACGAGAAGAAGCCAGGCGGC^ 

AAGGAGTCTAAGAGGAAGCTAGAAGAGAAGAGGCCAGAAC^ 

ACGGAGGCGGAGATCTACCTGATCATGGAOT 

ATCGTTGAAAATGTGAAGTTTCCAGAGCCCGAGGCTGCAG 

AAGGCCTTCGTCCACATGCACGACAAGAATATCG^TCCA 

CTCCTGGTTCAGCGAAATGAAGACAAGTCTATCACC^ 

GCCAAATATGTGGTGAGGCCTATATTTACTGTGTGTGGG^ 

GAAATTCTTTCTGAGAAAGGTTACGGCCTGGAGGTG^ACA 

CTATACATCCTCTTGTGTGGCTTCCCCCCTTTCCGAAGTC 

GTCTTGAACATCATGCAAGTGGG^ 

TCTGATGCTGCCAAAGATCTGGTGAGAAATTTGC TACCAAC ACAGGG 

S^gaSgS^ 

GTTGCAGAGCAGATGCCATAA 

CTACTAACTACAACTCCTCAAATTCC^AATAAAAAT^ 

CTTACATCTAAAGAGCTAGGGAG^GGA^TTX^ GAAGAGGACAGGATTGT 

GACATTAAAATAGTAGATTTTGGAATGTCTCGAA^ 
GAAATCATGGGAACACCAGAATATTTAGCTCCAGAAATCCTG 

ACAGCAACAGATATGTGGAATATTGG™^ 



WO 00/73469 
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FIGURE 2U 

rTTCGTTTCGATGACTCATTACCCAATCCCCR^C^TOTCRGATT 

CACTTTTTTCmGAGTCATTTGG^ 
TATGATTTGTAGCTTCATATATGACATGTTTATATTGTAA^ 

GTGAAAG 

SEQ ID NO: 30_W44160 M ^gtcggtcaGCGTCTGTGCG 
CCAGACGCGGCTGCACTTTTC^CCT^ 

GTCGCCGCCGGGAGTCGCCTCACAGGGGCCTGGCTGAC^ C ATGTCTCGGA 

GAGTGCGAGGTAAAAGTCTGCCTAGAGAA^ 

GGAGATTCGATTGCCGAAGTGTCTCAGGCTTGCTAAC 

AAACAGAGAATTTTAATAATTTCTATACT^ 
TTGCTGTGGTTAGACAATGTATATCA^TCAACTGG^^ 
TGAAAAAGAGGAGAAGAGGGCAGGATTGCCGGG^ 
TGGAGCTGGCCAGGTCTTGTCCCCACGTGAT^ 

CGGAAATCATTTTGGTGTTAGAATATGCT^ 
CTGAGTTAGCCGAAATGGTATCTGA^TGATGT^ 

AAGGAGTTCATTATCTACATCA^^ 

TACTTTTGAGCAGTATATACCCACTCGGGG^ 

GAAAAATTGGGAATGCAAGTGAGCTTCGGGA^TC^ 

CAGAAATCCTCAACTATGATCCCATTACCACA^ 
TAGCGTATATGTTGTTAACTCATACATCACC^TTTG 

ATCTGAATATTTCTCAAGTGAATGTA^^^ 
AGCTGGCCACAGACTTCATCCAGAGCCTTCTAG^ 

CAC^TCCTGCCTATCCCACTCATC^CTGCAGCAGTGU^ T GAAGAGA 

CTGAGGAAACTTCAGGCTCCrcTCAA^TTCAGGATC^ 

AGACCTCCAAGTCCTCCTGTAATGGGAGCTGTGGAG^ 

TATCAGATCTAATAAGATCTGGAAGATGGATATGXA^™ 
ATGATTAGAGTAGAGGAGAATTGGATAGTACAGAATATGCT 

TTCATAAAGGGAAATGTTAAGTTCTGGCAGCTGACT 
CAGGATAGAAGCCCATCATTAGAGCATAGGCACT 



PCTAJS00/14842 

WO 00/73469 

FIGURE 2V 

GAGGGGCCTAGGGACTGCTTTGGAGATTTCCCACTC 

atctcatgttaagataacagatctca^ 
acaSSaaaagcctgtgtattgaagtggg^^ 

S?agItaaaatacagcctttaaacaacttc 

SEQ ID NO: 31_H01248_H, DRAK1 H c CCCAGGCG CCACCTCAGGCTCG 

atgatccctttggagaagccaggcagcggcggct^ 

SrPGGGCAGGCCGGGGTCTGAGCGGGCCGTGCCG^^ 

gggSStgacagagatacgcgccgtgg^^ 
SgScccgggccgggagctgggcaggggg^ 

^^caacagatat'gtggagcattggag^ 
cgcgccggggtcccgaggcgcccccgctg^ 



rarCATCGGCAAGGUCAA^ - ^ - r , r , af , CTGGATG AAGAAAAL.i — - 

cSStgctatcaagatcatogat^gagccag^ 

mCCG(3GAAGrrCAAATTATGAAGAMCTTTGCC^ TCCTAGTGGAG GGGAAAT 



WO 00/73469 
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FIGURE 2W 

TGCCTATGATCACTATAGTGCAATCTACAGCCTGCTGTGTGATCGACATAAGAGACATAA 

AACCCTGCGTCTCGGAGCACTTCCTAGCATGCCCCGAGCCCTGGCCTTTCAAGCACCAGT 

CAATATCCAGGCGGAGCAGGCAGGTACTGCTATGAACATCAGCGTTCCCCAGGTGCAGCT 

GATCAACCCAGAGAACCAAATTGTGGAGCCGGATGGGACACTGAATTTGGACAGTGATGA 

GGGTGAAGAGCCTTCCCCTGAAGCATTGGTGCGCTATTTGTCAATGAGGAGGCACACAGT 

GGGTGTGGCTGACCCACGCACGGAAGTTATGGAAGATCTGCAGAAGCTCCTACCTGGCTT 

TCCTGGAGTCAACCCCCAGGCTCCATTCCTGCAGGTGGCCCCTAATGTGAACTTCATGCA 

CAACCTGTTGCCTATGCAAAACTTGCAACCAACCGGGCAACTTGAGTACAAGGAGCAGTC 

TCTCCTACAGCCGCCCACGCTACAGCTGTTGAATGGAATGGGCCCCCTTGGCCGGAGGGC 

ATCAGATGGAGGAGCCAACATCCAACTGCATGCCCAGCAGCTGCTGAAGCGCCCACGGGG 

ACCCTCTCCGCTTGTCACCATGACACCAGCAGTGCCAGCAGTTACCCCTGTGGACGAGGA 

GAGCTCAGACGGGGAGCCAGACCAGGAAGCTGTGCAGAGGTACTTGGCAAATAGGTCCAA 

AAGACATACACTGGCCATGACCAACCCTACAGCTGAGATCCCACCGGACCTACAACGGCA 

GCTAGGACAGCAGCCTTTCCGTTCCCGGGTCTGGCCTCCTCACCTGGTACCTGATCAGCA 

TCGCTCTACCTACAAGGACTCCAACACTCTGCACCTCCCTACGGAGCGTTTCTCCCCTGT 

GCGCCGGTTCTCAGATGGGGCTGCGAGCATCCAGGCCTTCAAAGCTCACCTGGAAAAAAT 

GGGCAACAACAGCAGCATCAAACAGCTGCAGCAGGAGTGTGAGCAGCTGCAGAAGATGTA 

CGGGGGGCAGATTGATGAAAGAACCCTGGAGAAGACCCAGCAGCAGCATATGTTATACCA 

GCAGGAGCAGCACCATCAAATTCTCCAGCAACAAATTCAAGACTCTATCTGTCCTCCTCA 

GCCATCTCCACCTCTTCAGGCTGCATGTGAAAATCAGCCAGCCCTCCTTACCCATCAGCT 

CCAGAGGTTAAGGATTCAGCCTTCAAGCCCACCCCCCAACCACCCCAACAACCATCTCTT 

CAGGCAGCCCAGTAATAGTCCTCCCCCCATGAGCAGTGCCATGATCCAGCCTCACGGGGC 

TGCATCTTCTTCCCAGTTTCAAGGCTTACCTTCCCGCAGTGCAATCTTTCAGCAGCAACC 

TGAGAACTGTTCCTCTCCTCCCAACGTGGCACTAACCTGCTTGGGTATGCAGCAGCCTGC 

TCAGTCACAGCAGGTCACCATCCAAGTCCAAGAGCCTGTTGACATGCTCAGCAACATGCC 

AGGCACAGCTGCAGGCTCCAGTGGGCGCGGCATCTCCATCAGCCCCAGTGCTGGTCAGAT 

GCAGATGCAGCACCGTACCAACCTGATGGCCACCCTCAGCTATGGGCACCGTCCCTTGTC 

CAAGCAGCTGAGTGCTGACAGTGCAGAGGCTCACAGCTTGAACGTGAATCGGTTCTCCCC 

TGCTAACTACGACCAGGCGCATTTACACCCCCATCTGTTTTCGGACCAGTCCCGGGGTTC 

CCCCAGCAGCTACAGCCCTTCAACAGGAGTGGGGTTCTCTCCAACCCAAGCCCTGAAAGT 

CCCTCCACTTGACCAATTCCCCACCTTCCCTCCCAGTGCACATCAGCAGCCGCCACACTA 

TACCACGTCGGCACTACAGCAGGCCCTGCTGTCTCCCACGCCGCCAGACTATACAAGACA 

CCAGCAGGTACCCCACATCCTTCAAGGACTGCTTTCTCCCCGGCATTCGCTCACCGGCCA 

CTCGGACATCCGGCTGCCCCCAACAGAGTTTGCACAGCTCATTAAAAGGCAGCAGCAACA 

ACGGCAGCAGCAGCAGCAACAGCAGCAACAGCAAGAATACCAGGAACTGTTCAGGCACAT 

GAACCAAGGGGATGCGGGGAGTCTGGCTCCCAGCCTTGGGGGACAGAGCATGACAGAGCG 

CCAGGCTTTATCTTATCAAAATGCTGACTCTTATCACCATCACACCAGCCCCCAGCATCT 

GCTACAAATCAGGGCACAAGAATGTGTCTCACAGGCTTCCTCACCCACCCCGCCCCACGG 

GTATGCTCACCAGCCGGCACTGATGCATTCAGAGAGCATGGAGGAGGACTGCTCGTGTGA 

GGGGGCCAAGGATGGCTTCCAAGACAGTAAGAGTTCAAGTACATTGACCAAAGGTTGCCA 

TGACAGCCCTCTGCTCTTGAGTACCGGTGGACCTGGGGACCCTGAATCTTTGCTAGGAAC 

TGTGAGTCATGCCCAAGAATTGGGGATACATCCCTATGGTCATCAGCCAACTGCTGCATT 

CAGTAAAAATAAGGTGCCCAGCAGAGAGCCTGTCATAGGGAACTGCATGGATAGAAGTTC 

TCCAGGACAAGCAGTGGAGCTGCCGGATCACAATGGGCTCGGGTACCCAGCACGCCCCTC 

CGTCCATGAGCACCACAGGCCCCGGGCCCTCCAGAGACACCACACGATCCAGAACAGCGA 

CGATGCTTATGTACAGCTGGATAACTTGCCAGGAATGAGTCTCGTGGCTGGGAAAGCACT 

TAGCTCTGCCCGGATGTCGGATGCAGTTCTCAGTCAGTCTTCGCTCATGGGCAGCCAGCA 

GTTTCAGGATGGGGAAAATGAGGAATGTGGGGCAAGCCTGGGAGGTCATGAGCACCCAGA 

CCTGAGTGATGGCAGCCAGCATTTAAACTCCTCTTGCTATCCATCTACGTGTATTACAGA 

CATTCTGCTCAGCTACAAGCACCCCGAAGTCTCCTTCAGCATGGAGCAGGCAGGCGTGTA 



WO 00/73469 
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FIGURE 2X 

ACAAGAAACAGAGAGTTTTGTGTACAGCTTGGGAATGAAAAGGTTGATTGTAAACCCACA 
GTATCTAGCAGCGTTGTGCCAAATTGCCCTTGTGTTTCTCTCCACCCAAAATATCACAGC 
TGCTTTCCTCACATTTGGTTCATCCGTGTGCTGTTCTTTTGGGTTCTGAGAGGGTTTTGC 
CATGTTTGCTTGTATGACCAAGTCACCAAGGAAATAAACAGGAAGGAAATCCATGTTCTC 



SEQ ID NO: 33_2R22-5-ll_H 

CTGGGCCGCTGCCGGTCAGGTCGGCCGCCCCTGACAGCTCCGGGAGCCTCAAGCGCGACA 

GGGCGCCCTCACCTCGGGACATCCACACACCGACCGCTCCTGCTCCAGAGGCAACAACCC 

AGCGCGCCTAGCCTGGCGCCGTGCAGCGAAGCCCAAGAGCTGGCCTCGCCACGAAGGTTG 

AACCAGCCAAATTTTCGAGACAGCTCACGGCTTAGAGGAAGGTTCATCTAAATAAAGGCC 

GGCTAAAGTGACATTGCAGGGATTAAATCCTTCTTTGGCTGCCTGTGTGACCAGAAGGCT 

TATTTGCAAGTTTCTTCTTTCCTGGGGTCCAGATTATTAGGTCTCCAGCGCCCTGCAGCT 

TGACAGAAAGAGAAGCATGAAATGAAGGTCAGAGATGAGATCCCGCAGCAGGGACGTGGG 

GGCCTCCCAGGGGCATTTACGCACCAGAGTGCAAGATTCTCTGGCCATCAAGGGAAATAG 

CAAACAGAAGCCTTTGTCCTGGGGCACAGCCACCTACCACAAAGCATCAGACTCCACGTC 

TGGCCAGAAAGTTCCTGGAGTCCCATCAGGCCAGTGGGTATGTAACATGTGCCTAATTGT 

ACAGCTAGAGCCTGCAAGTTCAACGTGAGGGAAGGTGGGAAATGTCTTGAGTGAGGCGAG 

CAGCTCCTGGCTGGGCTGGGCAGACTCAGCTACCACGTTCACTGCCTTCCTCTCACTAAA 

GCCGAGAGGGAGGCTGCTCAGCTCTCAGGAAAACTCTTTTGAACCCTGGGCACCTGCTGT 

CCTCAGTTGGCATCTCCCACCCTCTGAGCCTCTTCTGCTCCTGCACAACCTGCCTCTTCG 

CTGAGATGGAGACGTGAGCCCCCGTGGACGATGACTGCAGTGTATATGAATGGAGGTGGC 

CTGGTGAACCCCCACTATGCCCGGTGGGATCGGCGCGACAGTGTAGAAAGTGGCTGTCAG 

ACCGAGAGTAGCAAGGAGGGTGAGGAGGGACAGCCCCGCCAGCTGACGCCCTTCGAGAAA 

CTGACACAGGACATGTCCCAGGATGAGAAGGTGGTGAGGGAGATCACGCTGGGGAAACGG 

ATAGGCTTCTACCGAATTCGAGGGGAAATCGGAAGTGGAAACTTCTCCCAAGTGAAGCTT 

GGGATTCACTCCCTAACCAAAGAAAAGGTGGCCATTAAGATCCTGGACAAGACCAAGTTA 

GACCAGAAAACCCAGAGGCTACTATCCCGAGAAATCTCCAGCATGGAAAAGCTGCACCAT 

CCCAACATCATCCGGCTTTACGAAGTGGTGGAGACCCTATCCAAGCTGCACTTGGTGATG 

GAGTATGCAGGGGGTGGGGAGCTCTTCGGAAAAATTAGCACTGAGGGGAAGCTCTCTGAA 

CCAGAAAGCAAGCTCATCTTCTCCCAGATTGTGTCTGCCGTGAAGCACATGCATGAAAAC 

CAAATTATTCATAGAGATCTGAAAGCAGAAAATGTATTCTATACCAGTAATACTTGTGTG 

AAGGTGGGCGATTTTGGATTCAGCACAGTAAGCAAAAAAGGTGAAATGCTGAACACTTTC 

TGTGGGTGTCCTCCCTACGCTGCGCCTGAACTCTTCCGGGACGAGCACTACATCGGCATT 

TACGTGGATATCTGGGCCTTGGGGGTGCTTTTGTACTTCATGGTGACTGGCACCATGCCA 

TTTCGGGCAGAAACCGTGGCCAAACTAAAAAAGAGCATCCTCGAGGGCACATACAGTGTA 

CCGCCGCACGTGTCAGAGCCCTGCCACCGACTCATCCGAGGAGTCCTTCAGCAGATCCCC 

ACGGAGAGGTACGGAATCGACTGCATCATGAATGATGAATGGATGCAAGGGGTGCCATAC 

CCTACACCTTTGGAACCTTTCCAACTGGATCCCAAACATTTGTCGGAAACCAGCACTCTC 

AAGGAAGAAGAAAATGAGGTCAAAAGCACTTTAGAACATTTGGGCATTACAGAAGAGCAT 

ATTCGAAATAACCAAGGGAGAGATGCTCGCAGCTCAATCACAGGGGTCTATAGAATTATT 

TTACATAGAGTCCAAAGGAAGAAGGCTTTGGAAAGTGTCCCAGTCATGATGCTACCAGAC 

CCTAAAGAAAGAGACCTCAAAAAAGGGTCCCGTGTCTACAGAGGGATAAGACACACATCC 

AAATTTTGCTCGATTTTATAAATTGCACTAGACTGCTTGTAACTAACCAAGATGATTGTT 

GCTGCTTCTAAATTTTTTTCAAGGACAACTTGAGTGGAGACATTTTTGTAATTTTTAAAT 

AAACTTAAATTTGAGATATGCAAAAAAAAAA 
SEQ ID NO: 34_R31237_1_H, AAC33487 

ATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCA 
CATGGAGATGGGCGTCAAGAAGTTACCTCTCGTACCAGCCGCTCAGGAGCTCGGTGTAGA 
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FIGURE 2Y 

AACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAA 

ACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCAGA 

GAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCAACAAGTCTACAAAAGCTC 

TTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAA 

GTCATTGAAACTGAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTGAAGTA 

TTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCAAGATCTAAATTTAGA 

CAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAG 

GCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGCAGATTTCGGTTTTAGC 

AATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTCCATACGCAGCA 

CCTGAGCTCTTCCAGGGCAAGAAATATGACGGGCCAGAAGTGGATGTGTGGAGTCTGGGG 

GTCATTTTATACACACTAGTCAGTGGCTCACTTCCCTTTGATGGGCAAAACCTAAAGGAA 

CTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGT 

GAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAGCAA 

ATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTT 

GAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATAT 

TCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACA 

TATTTGTTATTGGGGAGAAAATCTTCAGAGCTGGATGCTAGTGATTCCAGTTCTAGCAGC 

AATCTTTCACTTGCTAAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTCT 

CCTCACCACAAAGTGCAGAGAAGTGTTTCTTCAAGCCAAAAGCAAAGACGCTACAGTGAC 

CATGCTGGACCAGCTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACAAGCACT 

GCAGATGGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAAGTGGCAGTGCTGTT 

GGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAG 

GCGGATATTCCTGAACGCAAGAAAAGCTCCACTGTCCCTAGTAGTAACACAGCATCTGGT 

GGAATGACACGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGCTGATAGACACTCA 

GTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATCAGAGAACTCCAGTTGCTTCA 

ACACACAGTATCAGTAGTGCAGCCACCCCAGATCGAATCCGCTTCCCAAGAGGCACTGCC 

AGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCT 

CCTGCCTCTCCCAGCCTGTCCCATGAAGCCACACCATTGTCCCAGACTCGAAGCCGAGGC 

TCCACTAATCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCT 

GAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATCCCTACGCTTCACCTGGAGCATG 

AAAACCACTAGTTCAATGGATCCGGGGGACATGATGCGGGAAATCCGCAAAGTGTTGGAC 

GCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGCGTCCACGGAGAT 

GGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCT 

CTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATCCATAGCCTTCAAAAATATTGCT 

TCCAAAATTGCCAATGAGCTAAAGCTGTAA 
SEQ ID NO: 35 W90839_M 

AAAGGGCCGTCCTGGTCCAGCCGTTCCCTGGGTGCCCGTTGCCGGAACTCTATCGCTTCC 
TGCCCTGAGGAACAACCCCATGTGGGCAACTATAGGCTGCTAAGGACCATCGGGAAGGGC 
AACTTCGCCAAAGTCAAGCTGGCTCGGCATATCCTCACGGGCCGGGAGGTCGCTATTAAG 
ATCATTGATAAGACCCAGCTGAACCCCAGTAGCTTGCAGAAGCTGTTCAGAGAAGTCCGA 
ATTATGAAGGGACTCAACCACCCCAACATCGTGAAGCTTTTTGAGGTGATAGAGACGGAG 
AAGACGCTATACCTGGTGATGGAATACGCTAGCGCAGGAGAAGTGTTTGACTACCTCGTG 
TCGCACGGCCGCATGAAGGAGAAGGAGGCTCGAGCCAAGTTCCGGCAGATCGTGTCAGCC 
GTGCACTACTGTCATCAGAAGAACATTGTACACAGGGATCTAAAGGCTGAAAACCTGTTG 
CTGGATGCCGAGGCCAACATCAAAATCGCCGACTTCGGCTTCAGCAATGAGTTCACGCTG 
GGCTCCAAGCTGGACACCTTCTGTGGGAGCCCCCCATACGCCGCCCCAGAGCTGTTCCAG 
GGCAAGAAGTATGATGGGCCAGAGGTGGACATCTGGAGCCTGGGTGTCATCCTGTACACG 
CTGGTCAGCGGCTCCCTGCCCTTCGATGGGCACAACCTCAAGGAGCTGCGGGAGCGAGTC 
CTCAGAGGAAAGTACCGGGTCCCCTTCTACATGTCTACAGACTGCGAGAGCATTCTGCGG 
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FIGURE 2Z 

AGATTTCTGGTGCTGAACCCCGCAAAACGCTGTACTCTGGAGCAAATCATGAAAGACAAA 

aS^?gtg?gcacagagcgaccaggatctgaacgcccgtccttgttg 
Saaaatag?tccggtacctcgcgggtgccccctgcctcgccttccagtcatagcctgg^ 
?c^g?Sggcgagcggagccgcctggctcggggctccaccatccgcagcaccttccat 
gggggccaggtccgagaccggcgggcagggagcgggagtggcgggggtgtgcagaatgga 

cccccagcc?cacc^^ 

cccaccaccaacctcttcaccaagctgacctccaaactgacccgaagggtcacagacgaa 
cSgagagaatcgggggacctgaggtcacaagttgccatctaccttgggataaaa^ 
Sgcccccaggctgctccgattcccctggagtgtgaagctgaccagctcgcgaccttcc 

tgaggccctgatggctgccctgcgacaggccaca 

crn TD NO- 36 406786.5 H 

gtagccggcttggcgtgaccgtcgcctgatccagttgttagaggtggaagcttggcagtt 
ggc?tocttcttcccatggaggtcgggggcttaacagtctttgaagaggaccagagatg 

rrTTTCCCAGAGCCTCCCCTTGCCAGTGTCAGCAGAGGGCCCAGCTGCACAGACCACTGC 

?gagcccagcaggtcgttttcctcagcccacagacacctgagcagaaggaatgggctttc 
SSactctcccagagcaggacggcgctctctgaagacagatggagctcctattgtct 
ScactLctgcccagaatatttgtacaagtaaactgcactgccctgctgcccctgagca 
cIcgg^ccgtccgaaccgcggggcagtgtgtcctgctgctccctgctgcggggactgt^ 

ScAGGGTGGTCCTCACCTCTGCTTCCGGCCCCTGTGTGCAACCCTAACAAGGCCATCTT 

ScGGTGGATGCCAAGACCACAGAGATCCTCGTTGCTAACGACAA^^ 
GGGGTACAGCAGCCAGGACCTGATTGGCCAGAAGCTCACGCAGTTCTTTCTGAGGTCAGA 

TTCTGATCTGGTGGAGGCCCTCAGCGAGGAGCACATGGAGGCCGACGGCCACGCTGCGGT 

GGTGTTTGGCACGGTGGTGGACATCATCACCCGTAGTGGGGAGAAGATTCCAGTGTCTGT 

gSgSgaS^ 

GGAGAGGOTCTCGACCTGGGTCGCTTTCCAGAGCGATGGCACCATCACGTCATGTGACAG 
TCTCTTTGCTCATCTTCACGGGTACGTGTCTGGGGAGGACGTGGCTGGGCAGCATATCAC 
AGACCTGATCCCTTCTGTGCAGCTCCCTCCTTCTGGCCAGCACATCCCAAAGAATCTCAA 

ga??cIgaggtctgttggaagagccagggacggtaccaccttccctctgagctta^^ 

GAAATCCCAACCCAGCAGCGAGGAGGCGACCACCGGTGAGGCGGCCCCTGTGAGCGGCTA 

gacStSacLcatcaaccacagcttcgcgctgacactgtttggttacggaaagacgg^ 

cctotgggcaIgaatatcactttcctgattcctggtttctacagcta 

gScagctca^cagctcccagacctggccag^ 

?gggtctggggagagaaccttggacccgtggcagggccaggacccagctgaggggggcca 
ggScSggattaatgtcgtgcttgctggtggccacgttg 

rAAGC^GGAAAGCCAAGACATCTTCACCGGGACTCAGACTGAGCTGATTGCTGGAGG 

SgcSgcSgtgc^ggtgaacaggcgctgcccaaggaccagcaa^^ 

GAGAGAGGAACCTGTGGCAAT 
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FIGURE 2AA 

GTCCTGCGATCTCMGAGGCAGAGMCTCTCCGGTGG^ ^cccrggagGTTGA 
CTATGCCTTGGCCACGGACCTCCCTGGGGGCCTGGAfiG GACAGACCAAAC 

tgLgacacgtgcccatcagcagaggagcc^ggctg^c^ 

CGTGATCGTGATGCGCGGGGCTGCTGGC^TG^ 
CGGGAGCTGCTACCATCGAGATGGCTTACGGCTGA^ 

GGAGCTCCAGGGCCCCACACCTCTGTTCTGCl^ CCAGCCTGC CCGGCTCCACCCA 
CCAACGCGACTCAGCCGCCAGGACCCGCCT 

CTCTACCGCTGCTGAGCTCACCGGACCCAGCC1 CGGCC TGTGAGGGCGAGTA 
GTTTGAGGAGCCCCCCAAGGCTOT^ 
CTCCCAAAAGTACAGTACCATGAGCCCG^ 
TGCTGTGGACAAGGGAAAAAACAAGGAGGTGG^^ 

CTTGGAGGATTGTTGGATTGAGGATCCCAAACT ATATATTTGAAAAC CAAGG 
TCTATCCAGGGTGGAGCACGCCAATATCATCAA^ 

GTTCTTCCAGCTTGTGA^ 
CCGCCACCCCAGGCTGGATGAGCCCCTGGCGA^ 

CCAGAGCCGTCTAGTGTCAGCAGTGGGATACCTGCb GCTGATAGAC TTTGG 
AAACAAGCCAGAAAGTGGAGTTCTGTCCGCTGCGA^CC ^ 

taGTGATGTGGCCCAGGCTCAGGAGCTTTGTGGG ACCRGCXMAC ACCAATrTC 

Sc^CG^S^ 
TGTCTGCTGTAGGATACCTGCACTCCCAAGGCATCATC^ 

AGAGAGAGGGTCTCTCTGGGCCTCCTGCTCCCAGGG^ TCCTGCTCTT TGGTTTG 

SS?ggaS 
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FIGURE 2BB 

CAGATCTGCCTGCGCTTGCCTCCC^™ 
GTTCTTCCTGGTTTTGTGTGTAGGTTTCTTTCC^ 

agaXcttggttttctatttctttttgt 

GGCACGAGGCGCGCCTGGCTGGGC 

GCAAGCGGAGCGCAGTTCGCCCA^GCCAAG^ 

CTCCTGTCCGCCGTGTCTAGCAGCGGGGCC^ 

CTTGCAGCGCGGGCCGGTCCGGGTGGGGT^ 
CAACTTCGCTGTGGTGAAGCTGGGGCGGCACCGGATC 

aataatcgataagtctcagctggatgcagtgaacot 
Sga^tgttagaccacc^^ 

AAGTATGTTGTACCTTGTGACAC^TATGCCAA^l CCTGTCTGC 
--TCATGGCCGGTTAAATGAG^T^ 



TAA' 



CCTTGTCTGTGGAGCTCTGCCCTTTGATGGACCGACTC^ GAGCA CCTTATCCG 

tSggaaggaagattccggattccgtattt^ 

AAGGATGTTGGTCCTAGACCCATCCAAACGGCTAACC^ 
ATGGATGCTCATAGAAGTTCCTGTCCAGAGACCTGTTCTCT 

Ugccatccatcggggagtttaatg^ 



.CTTGATGGCCGCCAGCGTCGGCCTA^ 

CTGTGGGGCTCCCAGTGACCATGCATTCACC^ 
TCCCCCAGGCATCCAACGTGGAGGCCTTTTCAT^ 
AGCTGCATTCATGGAAGAAGAGTGTGTGGACAC^ 
CCCTGTGCCTCCTGTCCTGGTGCGGAAGGGATC 



AGATCAGCAGAAARCCATTGAGT^^ 
TTATTTCTTGTTGGTGGAGCGCCTGAAATCACATC 

CATGCATTCACCGAACATGAGGC 
GGAGGCCTTTTCATTTCCAGCA' 
GTGTGTGGACACTCCAAAGGTC) 
GCGGAAGGGATGCCAGTCACTG 
rACCTCCATTGACGAAGGGCTGGAGACAGAAGGAGAGGCC^ 

SSgaggcatttcagtccacacgcagcgggc^ 

CAATCAACTGGTCGTGATGCCTGGGGCAGGGAAAATTT^ 

AGGAATTCTAGAGTTGAACA^GTGCAGTTGTTGTO^^ ^^^^^^^^qq^ 
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FIGURE 2CC 

GCAGCAGCTGCAGGAACATAGGCTCCAGCAGAAGCGACTCTTTCTTCAGAAGCAGTCTCA 

aSgSgSctattttaatcagatgcagatagcagagag 
gSgctgccccttccccgccaggagactccaccgccttctcagcaggccccaccgttcag 

?Sgacccagcccctgagccccgtcctggagccttcctccgagcagatgcaatacagc 

SotSg2cagtaccaagagatgcagcttcagcccctgccctccacttcc 

ggctgctcctcctctgcccacgcagctacagcagcagcagccgccaccgccaccaccccc 

tccaccaccacgacagccaggagctgccccagcccccttacagttctcctatcagact 

tSgctgccaagcgctgcttcccctgcgccagactatcccactccctgtcagtatcctgt 

gga?ggagcc^SLagcgacctaac 

agagSccctccagctacgacccactagccctctctgagctacctggactctt 

?ga^tcctagacgctgtggatccacaacacaacgggtatgtcctggtgaattagt^ 

g?a^ggaattgaggtgggtcaggtgaaggaagagtgtatgttcctatttttattcc^ 

cSttaaatttaaagcttattttcttgccctctccctaacggggagaaatcgagccaccc 

aactggaatcagagggtctggctggggtggatgttgcttcctcctggttctgccccacca 

caaagttttctgtggcaagtgctggaacatagttgtaggctgaggcaggagaatggcgtg 

^?XgLggagcttgcagtgaggcaagatcgtgccactgcactccagcctgggcg 

^gagcaagactccacctcaaaaaaaaaaaaaaaggacaagagcagtatcatctgcctc 

^ISg^ggtctgaagcacctgtaatgtcagagcccttgtctggcccttggtgg^ 
gSaacgaaagcagtggagcctctcaccttccagtagcctctcacattcttattttacca 

?^OTC^AA^ 

^™^^ GTCCACATCTT ctccaggcaggtttcaacctatcagcagactcaggcacac 

ac?gggg?acLatagagaaccaggcggcagcagtgctcgcag^ 
?Sgatgggttctgcccagatactctgctcgcccacccacaagggagcaatagcttatat 

S?SctcLtagcatctgactcttttcataagcaaacagctgtataaacaaagcccc^ 
SSgScaaLacagggtgaatgtgatattgttcccacaaccttattctccactcaaca 

gccgotgotttggggaagaggccgcctt^ 

?gcacSaaccaggctgagggagacaaaaaccccgcagacccgcctgcctttcagcgtcc 

IgSStccaSaagtttaggctcacctcaaagatgtctagtttttcc^ 

agSctttcctLagaacacccccttcctcaattgccaaggggccgcatcgcacggca^ 

agSSccactgcaggccagcagattccaccccaggaacggtcatgaactcagcctt 

c??Sgaggggcgtaacatttccttacagtcaagc 
S?™ggattaaaaaagtaataac^gactttgacttaatac 

aarTGGGTGGGTAGAGGGGAGCTTT^ 

?g?Sg?gtc?gtctgcagaagatttgctcagtcaaggaaattcaagtggtgagacct^^ 
caccSggSS^Lgagaaacctg 

rar aaaggtttgcttcacttcggggactgcagtttgagaaataaaagggatacagagata 

toSggSSttggtctgaaagagttacttttgataaagttaatctaa^ 

™?gtgtgcttttttttaattactaagaw 
??Sgag?g?aLtcataatagctccaatgtgaaaaaaaaaa 

IS?S?gttagaaaattgcctaaaatgcagtgtaataa^^ 

A^TTTAAATGGGGTAATTTTCTGCAAGGAAAATGTACTGTTTTTATGTTTCCAACCCTCT 



TGA 



5*1 U) 
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FIGURE 2DD 

SEQ ID NO: 3 9_AA207220_H 

GCTGTGGCTCCCCGTCCTGGTGCGGGACCTGTGCCCCGCGCTTCAGCCCTCCCCGCAAGC 

CTATTGATTCCCCTGCCGCCCTTGCTCCACCTCCTGCTCGCCATGGAGTCGCTGGTTTTC 

GCGCGGCGCTCCGGCCCCACTCCCTCGGCCGCAGAGCTAGCCCGGCCGCTGGCGGAAGGG 

CTGATCAAGTCGCCCAAGCCCCTAATGAAGAAGCAGGCGGTGAAGCGGCACCACCACAAG 

CACAACCTGCGGCACCGCTACGAGTTCCTGGAGACCCTGGGCAAAGGCACCTACGGGAAG 

GTGAAGAAGGCGCGGGAGAGCTCGGGGCGCCTGGTGGCCATCAAGTCAATCCGGAAGGAC 

AAAATCAAAGATGAGCAAGATCTGATGCACATACGGAGGGAGATTGAGATCATGTCATCA 

CTCAACCACCCTCACATCATTGCCATCCATGAAGTGTTTGAGAACAGCAGCAAGATCGTG 

ATCGTCATGGAGTATGCCAGCCGGGGCGACCTTTATGACTACATCAGCGAGCGGCAGCAG 

CTCAGTGAGCGCGAAGCTAGGCATTTCTTCCGGCAGATCGTCTCTGCCGTGCACTATTGC 

CATCAGAACAGAGTTGTCCACCGAGATCTCAAGCTGGAGAACATCCTCTTGGATGCCAAT 

GGGAATATCAAGATTGCTGACTTCGGCCTCTCCAACCTCTACCATCAAGGCAAGTTCCTG 

CAGACATTCTGTGGGAGCCCCCTCTATGCCTCGCCAGAGATTGTCAATGGGAAGCCCTAC 

ACAGGCCCAGAGGTGGACAGCTGGTCCCTGGGTGTTCTCCTCTACATCCTGGTGCATGGC 

ACCATGCCCTTTGATGGGCATGACCATAAGATCCTAGTGAAACAGATCAGCAACGGGGCC 

TACCGGGAGCCACCTAAACCCTCTGATTGCCTGNNTGGCCTGATCCGGTGGCTGTTGATG 

GTGAACCCCACCCGCCGGGCCACCCTGGAGGATGTGGCCAGTCACTGGTGGGTCAACTGG 

GGCTACGCCACCCGAGTGGGAGAGCAGGAGGCTCCGCATGAGGGTGGGCACCCTGGCAGT 

GACTCTGCCCGCGCCTCCATGGCTGACTGGCTCCGGCGTTCCTCCCGCCCCCTCCTGGAG 

AATGGGGCCAAGGTGTGCAGCTTCTTCAAGCAGCATGCACCTGGTGGGGGAAGCACCACC 

CCTGGCCTGGAGCGCCAGCATTCGCTCAAGAAGTCCCGCAAGGAGAATGACATGGCCCAG 

TCTCTCCACAGTGACACGGCTGATGACACTGCCCATCGCCCTGGCAAGAGCAACCTCAAG 

CTGCCAAAGGGCATTCTCAAGAAGAAGGTGTCAGCCTCTGCAGAAGGGGTACAGGAGGAC 

CCTCCGGAGCTCAGCCCAATCCCTGCGAGCCCAGGGCAGGCTGCCCCCCTGCTCCCCAAG 

AAGGGCATTCTCAAGAAGCCCCGACAGCGCGAGTCTGGCTACTACTCCTCTCCCGAGCCC 

AGTGAATCTGGGGAGCTCTTGGACGCAGGCGACGTGTTTGTGAGTGGGGATCCCAAGGAG 

CAGAAGCCTCCGCAAGCTTCAGGGCTGCTCCTCCATCGCAAAGGCATCCTCAAACTCAAT 

GGCAAGTTCTCCCAGACAGCCTTGGAGCTCGCGGCCCCCACCACCTTCGGCTCCCTGGAT 

GAACTCGCCCCACCTCGCCCCCTGGCCCGGGCCAGCCGACCCTCAGGGGCTGTGAGCGAG 

GACAGCATCCTGTCCTCTGAGTCCTTTGACCAGCTGGACTTGCCTGAACGGCTCCCAGAG 

CCCCCACTGCGGGGCTGTGTGTCTGTGGACAACCTCACGGGGCTTGAGGAGCCCCCCTCA 

GAGGGCCCTGGAAGCTGCCTGAGGCGCTGGCGGCAGGATCCTTTGGGGGACAGCTGCTTT 

TCCCTGACAGACTGCCAGGAGGTGACAGCGACCTACCGACAGGCACTGAGGGTCTGCTCA 

AAGCTCACCTGAGTGGAGTAGGCATTGCCCCAGCCCGGTCAGGCTCTCAGATGCAGCTGG 

TTGCACCCCGAGGGGAGATGCCTTCTCCCCCACCTCCCAGGACCTGCATCCCAGCTCAGA 

AGGCTGAGAGGGTTTGCAGTGGAGCCCTGAGCAGGGCTGGATATGGGAAGTAGGCAAATG 

AAATGCGCCAAGGGTTCAGTGTCTGTCTTCAGCCCTGCTGAACGAAGAGGATACTAAAGA 

GAGGGGAACGGGAATGCCCGCGACAGAGTCCACATTGCCTGTTTCTTGTGTACATGGAGG 

GGCCACAGAGA 

SEQ ID NO: 40 AA426580_H, MAK_V_H 

ATGCCGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGCGGCGCCTGGGGGCGGCGGC 
GGCGCGGAGGACGCGGCCAGGCCCGCGGCGGCCTGCGAGGGAAGTTTCCTGCCTGCCTGG 
GTGAGCGGCGTGCCCCGCGAGCGGCTCCGCGACTTCCAGCACCACAAGCGCGTGGGCAAC 
TACCTCATCGGCAGCAGGAAGCTGGGCGAGGGCTCCTTTGCCAAGGTGCGCGAGGGGCTG 
CACGTGCTGACCGGGGAGAAGGTGGCCATAAAAGTCATTGATAAGAAGAGAGCCAAAAAG 
GACACCTATGTCACCAAAAACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCAC 
CCCAATATCACTCAGCTCCTTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATG 
GAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGGAGGAG 
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TCCGAAGCCCGCAGATACATCCGACAGCTCATCTCTGCCGTAGAGCACCTGCACCGGGCC 

GGGGTGGTCCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 

AAGCTGATTGACTTTGGTTTGAGCAACTGCGCAGGGATCCTGGGTTACTCGGATCCGTTC 

AGCACACAGTGTGGCAGCCCTGCCTACGCTGCACCTGAACTGCTCGCCAGGAAGAAATAC 

GGCCCCAAAATCGATGTCTGGTCCATAGGTGTGAACATGTATGCCATGTTGACCGGGACG 

CTGCCTTTCACGGTGGAGCCTTTCAGCCTGAGGGCTTTGTACCAGAAGATGGTAGACAAA 

GAAATGAACCCCCTCCCCACTCAGCTCTCCACAGGTGCCATCAGTTTCCTGCGCTCTCTC 

CTGGAACCGGATCCTGTGAAGAGGCCAAATATTCAGCAGGCACTGGCGAATCGCTGGCTT 

AATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAACAGGATTTCTCTG 

GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 

GACGTGATCAACACTGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTC 

TTAAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCCAGGACAGCCTCTGC 

TACAAGACCCGGCTCTACCAGATAGAAAAGTACAGGGCCCCCAAGGAGTCCTATGAGGCC 

TCTCTGGACACCTGGACACGAGATCTTGAATTCCATGCCGTGCAGGATAAAAAGCCCAAA 

GAACAAGAAAAAAGAGGGGATTTTCTTCATCGACCATTCTCCAAGAAGTTGGACAAGAAC 

CTGCCCTCGCACAAACAGCCCTCAGGCTCGCTTATGACACAGATTCAGAACACCAAAGCC 

CTCCTGAAGGACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCTGC 

CGCAATATTTTCCGCAAAACCTCAGATTCCAATTGTGTGGCTTCTTCTTCCATGGAGTTC 

ATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCCCATCAGCCA 

GGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGATGCTGGACATGGTGCGC 

TCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTCTCCCTCTCATCACTAC 

AGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAGCGAGAGGACGCTGTCC 

CCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTTGCATCCAACTCTGGTC 

TCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGGCCTGTGTTGCCCACCT 

CCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAATTGTGTGAAAAGCCGA 

GGCCGGTTCCCTATGATGGGCATCGGACAGATGTTAAGGAAGCGCCATCAGAGTCTGCAG 

CCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCAGCCCCTAGCCCCTGTG 

AACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTGCTAA 

SEQ ID NO: 41_Z3 6720_H 

ATGGACACAAAGCTGAACATGCTGAACGAGAAGGTGGACCAGCTCCTGCACTTCCAAGAA 
GATGTCACAGAGAAGTTGCAGAGCATGTGCCGAGACATGGGCCACCTGGAGCGGGGCCTG 
CACAGGCTGGAGGCCTCCCGGGCACCGGGCCCGGGCGGGGCTGATGGGGTTCCCCACATT 
GACACCCAGGCTGGGTGGCCCGAGGTCCTGGAGCTGGTGAGGGCCATGCAGCAGGATGCG 
GCCCAGCACGGTGCCAGGCTGGAGGCCCTCTTCAGGATGGTGGCTGCGGTGGACAGGGCC 
ATCGCTTTGGTGGGGGCCACGTTCCAGAAATCAAAGGTGGCGGATTTCCTCATGCAGGGG 
CGTGTGCCCTGGAGGAGAGGCAGCCCAGGTGACAGCCCTGAGGAGTGGGTAAAAGAGGAG 
GAGGTCTGTTTCATGCCTCCAGTTCCCCCAGCTCCGGGGGCAGCAGGACAGAGCCTGCAG 
AAGGATAAGGGGGAGCTGTCTGCCGAGCAGGGGATCTGGGCCACATTGATGACGCTGGTG 
ATCATGGTGACAGCGGCAAATAAAGAGCGAGTGGAAGAAGAGGGAGGAAAACCAAAGCAT 
GTGCTGAGCACCAGTGGGGTGCAGTCTGATGCCAGGGAGCCTGGGGAAGAGAGCCAGAAG 
GCGGACGTGCTGGAGGGGACAGCGGAGAGGCTGCCCCCCATCAGAGCGTCAGGGCTGGGA 
GCTGACCCCGCCCAGGCAGTGGTCTCACCGGGCCAGGGAGATGGTGTTCCTGGCCCAGCC 
CAGGCATTCCCTGGCCACCTGCCCCTGCCCACAAAGGTGGAAGCCAAGGCTCCTGAGACA 
CCCAGCGAGAACCTCAGGACTGGCCTGGAATTGGCTCCAGCACCCGGCAGGGTCAATGTG 
GTCTCCCCGAGCCTGGAGGTTGCACCAGGTGCAGGACAAGGAGCATCGTCCAGCAGGCCT 
GACCCTGAGCCCTTAGAGGAAGGCACGAGGCTGACTCCAGGGCCTGGCCCTCAGTGCCCA 
GGGCCTCCAGGGCTGCCAGCCCAGGCCAGGGCAACCCACAGTGGTGGAGAAACACCTCCA 
AGGGCAGCCCTGCTGAAGGGCGCTGTGGCCCCGGGCTTCTCTCGGAGGGACCTGGTGTTT 
CCTAGCATCTTCTGCGCCTGCCTAGGGATCTCCATCCACATACAAGAGATGGATACTCCT 
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GGGGAGATGCTGATGACAGGCAGGGGCAGCCTTGGACCCACCCTCACCACAGAGGCTCCA 

GCAGCTGCCCAGCCAGGCAAGCAGGGCCCACCTGGGACCGGGCGCTGCCTCCAAGCCCCT 

GGGACTGAGCCCGGAGAACAGACCCCTGAAGGAGCCAGAGAGCTCTCCCCGCTGCAGGAG 

AGCAGCAGCCCCGGGGGAGTGAAGGCAGAGGAGGAGCAAAGGGCTGGGGCCGAGCCTGGC 

ACGAGACCAAGCTTGGCCAGGAGTGACGACAATGACCACGAGGTTGGGGCCCTGGGCCTG 

CAGCAGGGCAAAAGCCCAGGGGCGGGAAACCCTGAGCCTGAGCAGGACTGTGCAGCCAGG 

GCTCCGGTGAGAGCTGAAGCAGTAAGGAGGATGCCCCCAGGCGCCGAGGCTGGCAGCGTG 

GTTCTGGATGACAGTCGGGCCCCACCAGCTCCTTTTGAACACCGGGTAGTGAGCGTCAAG 

GAGACCTCCATCTCTGCGGGTTACGAGGTGTGCCAGCACGAAGTCTTGGGAGGGGGTCGG 

TTTGGCCAGGTCCACAGGTGCACAGAGAAGTCCACAGGCCTCCCACTGGCTGCCAAGATC 

ATCAAAGTGAAGAGCGCCAAGGACCGGGAGGACGTGAAGAACGAGATCAACATCATGAAC 

CAGCTCAGCCACGTGAACCTGATCCAGCTCTATGACGCCTTCGAGAGCAAGCACAGCTGC 

ACCCTTGTCATGGAGTACGTGGACGGGGGTGAGCTCTTCGACCGGATCACAGATGAGAAG 

TACCACCTGACTGAGCTGGATGTGGTCCTGTTCACCAGGCAGATCTGTGAGGGTGTGCAT 

TACCTGCACCAGCACTACATCCTGCACCTGGACCTCAAGCCGGAGAACATATTGTGCGTC 

AATCAGACAGGACATCAAATTAAGATCATTGACTTTGGGCTGGCCAGAAGGTACAAGCCT 

CGAGAGAAGCTGAAGGTGAACTTCGGCACTCCTGAGTTCCTGGCCCCAGAAGTCGTCAAT 

TATGAGTTTGTCTCATTCCCCACAGACATGTGGAGTGTGGGAGTCATCACCTACATGCTA 

CTCAGTGGCTTGTCCCCATTTCTAGGGGAAACAGATGCAGAGACCATGAATTTCATTGTA 

AACTGTAGCTGGGATTTTGATGCTGACACCTTTGAAGGGCTCTCGGAGGAGGCCAAGGAC 

TTTGTTTCCCGGTTGCTGGTCAAAGAGAAGAGCTGCAGAATGAGTGCCACACAGTGCCTG 

AAACACGAGTGGCTGAATAATTTGCCTGCCAAAGCTTCAAGATCCAAAACTCGTCTCAAA 

TCCCAACTACTGCTGCAGAAATACATAGCTCAAAGAAAATGGAAGAAACATTTCTATGTG 

GTGACTGCTGCCAACAGGTTAAGGAAATTTCCAACTTCTCCCTAA 

SEO ID NO: 4 2 SGK088_H 

GGGGAGATGGCGCTGTTTGAGTGCCTGGTGGCGGGGCCCACTGACGTGGAGGTGGATTGG 

CTGTGCCGTGGCCGCCTGCTGCAGCCTGCACTGCTCAAATGCAAGATGCATTTCGATGGC 

CGCAAATGCAAGCTGCTACTTACATCTGTACATGAGGACGACAGTGGCGTCTACACCTGC 

AAGCTCAGCACGGCCAAAGATGAGCTGACCTGCAGTGCCCGGCTGACCGTGCGGCCCTCG 

TTGGCACCCCTGTTCACACGGCTGCTGGAAGATGTGGAGGTGTTGGAGGGCCGAGCTGCC 

CGTTTCGACTGCAAGATCAGTGGCACCCCGCCCCCTGTTGTTACCTGGACTCATTTTGGC 

TGCCCCATGGAGGAGAGTGAGAACTTGCGGCTGCGGCAGGACGGGGGTCTGCACTCACTG 

CACATTGCCCATGTGGGCAGCGAGGACGAGGGGCTCTATGCGGTCAGTGCTGTTAACACC 

CATGGCCAGGCCCACTGCTCAGCCCAGCTGTATGTAGAAGAGCCCCGGACAGCCGCCTCA 

GGCCCCAGCTCGAAGCTGGAGAAGATGCCATCCATTCCCGAGGAGCCAGAGCAGGGTGAG 

CTGGAGCGGCTGTCCATTCCCGACTTCCTGCGGCCACTGCAGGACCTGGAGGTGGGACTG 

GCCAAGGAGGCCATGCTAGAGTGCCAGGTGACCGGCCTGCCCTACCCCACCATCAGCTGG 

TTCCACAATGGCCACCGCATCCAGAGCAGCGACGACCGGCGCATGACACAGTACAGGGAT 

GTCCATCGCTTGGTGTTCCCTGCCGTGGGGCCTCAGCACGCCGGTGTCTACAAGAGCGTC 

ATTGCCAACAAGCTGGGCAAAGCTGCCTGCTATGCCCACCTGTATGTCACAGATGTGGTC 

CCAGGCCCTCCAGATGGCGCCCCGCAGGTGGTGGCTGTGACGGGGAGGATGGTCACACTC 

ACATGGAACCCCCCCAGGAGTCTGGACATGGCCATCGACCCGGACTCCCTGACGTACACA 

GTGCAGCACCAGGTGCTGGGCTCGGACCAGTGGACGGCACTGGTCACAGGCCTGCGGGAG 

CCAGGGTGGGCAGCCACAGGGCTGCGTAAGGGGGTCCAGCACATCTTCCGGGTCCTCAGC 

ACCACTGTCAAGAGCAGCAGCAAGCCCTCACCCCCTTCTGAGCCTGTGCAGCTGCTGGAG 

CACGGCCCAACCCTGGAGGAGGCCCCTGCCATGCTGGACAAACCAGACATCGTGTATGTG 

GTGGAGGGACAGCCTGCCAGCGTCACCGTCACATTCAACCATGTGGAGGCCCAGGTCGTC 

TGGAGGAGCTGCCGAGGGGCCCTCCTAGAGGCACGGGCCGGTGTGTACGAGCTGAGCCAG 

CCAGATGATGACCAGTACTGTCTTCGGATCTGCCGGGTGAGCCGCCGGGACATGGGGGCC 
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5CTCGGTCACATTGGAGCTG 



ctcacctgcaccgcccgaaaccg™ 

rCACCAGGAGATCGGCAGGGGTGCTTTCTCCTA^ 

CGGCCTGGAGTTTGCGGCCAAGTTCATCCC^ 
TCGGGAGGCCCGGCTGCTGGCCAGGCTCCAGCA^ 
CTTCGAGAGGCGCCGGGGACTGGTCATTGTCACC^ 

ocgaatcgccaggaaacccaccgtgtg™ 



IgCTCCGGCCTGGAGTTTGCGGCCAAOH^^— — — CTGTGTCC TCTACTTCCAT 

gcgcgtcgggaggcccggctgctggccaggctcca 

gaggccttcgagaggcgccggggactggt^^^ 
ctggagcgaatcgccaggaaacccaccgtgtg^ 
caggtgctagagggaatacactacctgcaccagagcca^ 
cctgagaacctgctggtgtgggatggtgctgcgggcgag^ 



cLgaccaggaggctcccagcccagaggccctc^ 
ggggctagccccaggcggggagagctccgcag^ 
cgggccgggccgcgggagctgggccggg^ 
cagcgccggagccccggcccgggagccac^ 



fcfoTCCOAGCCTCCAAGCCTGCACCACC^^^ 
CTCA( 

CCCTTCCGTGGGGCCGAGGAGGAGGATGGCATATACCGGCCCA^ 



CTCAGACcSa^GATCATTCAG^^^^ 

'-^Sgggcgtggcgcgggggcttcagaggccaagc^^ 
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FIGURE 2HH 

GGCCGCAGCACGCCGCTGTTCGGACGGCTTCGCAG^ GCGCCACCA CGCCTTCC 
CGGCGCCTTGGCCTTCCGCAC^ 



GCCGAGTCCCTGGGCTCCGAGGCCAGLUCu^o^^^- CCACCA 

cgaSccggctccgctggggc^ 

AACCTCTCTGCCAGCGTCCAGGAGGAGTTGGGTCACCA^l GGGAGGC AGCC 

^ccccccagtcttccacatcaaactcaaggaccag^ 
Iccctgctctgcctgccagcggcctgccctgcacc^ 

A .±^^^-r^ a r.r,TrAGAGCCCTCAGTGATCATCGTGTCCTGCA^ 



CTGACTGTGAGGTTCGGTGTGGCGTG^^ 

CCGATCCCAGCCTCCACTCCTCAAGGGGTTA^C^ 
GTGGTGACTTCCTTTGTGTCTGCACCACCAGCCC^ 

rrTGAGCCTACCAAGGTGACTGTGCAGAGCCTCAGCCL.^ CGACAGGGT CCC 
TATGAGCCAGACCCCCAGGAA^CGGAGGCTC^ 

^<aT(^GCTOCGCCGCCA^^CACC^C^ ^^^^ 
^^^^ 

CGG SfAGGGGGTGGAGG T G.XGGGA = 



GAC 



AAAAGGAATCGAGGGACAGGAAGGGGGAG^ 

61-1 II s ) 
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GAGAGCTGGAGGCCAGCAGTCACTCACACTCGCTCTGTCCTCCTGTCCAGTGGATACAGC 
CCTGGGCGCTCTGCTGGCCCAAGGATGTCCCCACTGCCCCTCCATGGCCTTTGGCCTTCT 
TCCCATTCATATTTATTTATTTATTGACTTTTATGAAGTTTCCCCTTCCATCCGATCCCT 
ACTGCCCATGTTGTCCTGACCATCCCTCCCAGCCATCCAGCTGTCTGTCTGTCTGCCACA 

AGGAAATAAAAATGGCAAGCAGCAAAAAAA 
qFO ID NO: 43 AA542015 M SGK088_M 

GCCACGGACATCTGGGGAGCGGGTGTGCTCACCTACATCATGCTTAGTGGGTACTCCCCA 

TTCTATGAGCCAGACCCCCAGGAAACAGAGGCTCGGATTGTTGGGGGTCGCTTTGATGCC 

TTCCAGTTGTATCCTAACACATCCCAGAGTGCCACCCTCTTCTTGAGAAAGGTCCTCTCA 

GTACATCCCTGGAGCCGGCCCTCTCTGCAGGACTGCTTGGCCCACCCATGGCTGCAAGAT 

GCCTACCTGATGAAGCTGCGCCGCCAGACACTCACCTTCACCACCAACCGGCTCAAGGAA 

TTCCTGGGCGAGCAGCGGCGACGTCGGGCTGAGGCTGCTACCCGTCACAAGGTGCTGCTC 

CGCTCCTACCCTGGCAGCCCCTAGGTGGCACAGACCGCAGCCCGGCCACGGGCTTCAACT 

TrrCTTCTCACTCGCGCTGCCAAGGGACATTCCAGAGCCCATGCTGAGCTGGACAGGCAG 

GGGCTTCAGATACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACATCTGGCTGGGCTATT 

ACCTCATGGACCTAAGAGGACAAGGCCCTGGGGCTTCAGCCGAATGTCACCCCGGCCATA 

ACCAGAGCAGGAGACCCACTGGCCAGGCTGGGCAAGGGTGAGAGCAGAAAGAGGCAAAGA 

GGGAGTTGGGAAGTGAAGAATGAGACGGAGGATAGAGAGGGAGGAGTTTGAGGAAGGTTC 

TAGGCTGGAGTGGAATGCTATATCTCAGGGAGAAGCCAGAAGGGGACATGGCTGAAGAGG 

AACAAGGACCCTGTGATGTGGGAATGTGGTGGAGAGGAGGACTGGACATAGAGAGTGTGC 

CAGGAGCCAGAGCAGAGACATAAGGGAGGGCAGAAGGGTAGAAGGCAACAGGAGTGGGCT 

CAGGGGTGGCAGGGCAGGCCAGCAGCTGCATCTTCAGAAAGAGAGAGGAGAAAGGCAAAG 

AGACGAAAGGCCGCTCCAGCTGGTCTCCTGTCCCAGCCGATGCAGTTCTGGGCGTTCTCC 

aSggcccagggatgtcctcactgctcctccatggcctttgccctccttcccatttgtat 

TTATTTATTTATTGCCTTTTGTGGAGTTTCCTTTCTATCCAGTCCCTAGTGCCTATGTTG 
TCCCGACCATCCCCCTTCAGTCACCCAGCTGTCTGTGCAGCTGTCTGTCTGTCTGTCACA 
AGGAAAATAAAACAAAACAAAACAAAACAAAACAAACAAAAAAAAAACAGC 

cpn TD NO- 44 R19772 H 

ATGAAGGGCGGCGACAGGGCTTACACCCGAGGTCCCTCTTTGGGGTGGCTCTTTGCTAAG 
TGCTGCTGTTGCTTCCCGTGTAGAGATGCATACTCTCATTCCTCAAGCGAGAATGGAGGC 

IIgtccgagtccgtagccaacctgcaggcccagccctccctgaacttcatccacagttcc 

CCGGGTCCCAAGCGCTCCACCAACACTCTTAAGAAGTGGCTGACGAGTCCTGTGCGTCGG 
CTCAACAGCGGGAAGGCAGATGGAAACATCAAAAAGCAGAAGAAAGTTCGCGATGGTCGG 
AAGAGCTTTGACCTGGGATCTCCCAAGCCTGGGGATGAAACAACCCCTCAGGGAGACAGC 
GCTGATGAGAGCAAGAAAGGTTGGGGTGAAGATGAGCCGGATGAAGAGTCACACACACCC 
CTCCCACCACCTATGAAGATTTTTGACAACGACCCTACACAGGATGAAATGTCCTCCTCT 

Sgctagcagcccggcaggcttccactgaagtacctactgctgcagaccttgtcaatgca 

ATAGAAAAGTTGGTCAAAAACAAGCTGAGTCTAGAAGGAAGCTCATACCGGGGGAGCTTG 
AAAGACCCTGCAGGCTGCCTGAATGAGGGGATGGCCCCACCCACACCTCCTAAAAATCCA 
GAAGAAGAACAGAAAGCCAAGGCCCTGAGAGGCAGGATGTTTGTCCTGAATGAGCTGGTA 
CAGACAGAGAAAGACTATGTCAAGGATCTGGGCATTGTGGTGGAGGGCTTCATGAAGAGA 
ATAGAAGAAAAGGGTGTCCCTGAGGATATGCGAGGAAAGGACAAAATCGTGTTTGGAAAT 
ATTCATCAGATTTATGACTGGCATAAGGATTTTTTCCTGGCGGAACTGGAAAAGTGTATC 
CAGGAGCAAGACAGATTGGCACAGCTCTTTATTAAGCACGAGCGGAAGCTGCACATCTAC 
GTGTGGTATTGTCAGAATAAGCCGCGCTCAGAGTACATCGTTGCTGAGTATGACGCCTAC 
TTTGAGGAGGTAAAACAGGAGATAAATCAGAGGCTGACACTGAGTGACTTCCTCATCAAG 
CCCATTCAGAGAATAACAAAATACCAGTTGCTCCTCAAGGACTTCCTGAGATACAGTGAG 
AAGGCTGGTTTGGAGTGTTCAGATATCGAGAAAGCAGTGGAGTTAATGTGCCTTGTTCCC 
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AAACGCTGCAATGACATGATGAATCTAGGACGTCTGCAGGGCTTTGAGGGCACTCTGACT 

GCTCAGGGGAAGCTACTGCAGCAGGACACATTCTATGTGATCGAGCTGGATGCAGGCATG 

CAGTCCCGGACCAAAGAGAGGCGCGTGTTCCTCTTCGAGCAGATTGTCATCTTCAGTGAA 

CTGCTCAGGAAGGGATCCCTCACCCCTGGCTACATGTTCAAAAGGAGCATCAAGATGAAT 

TACTTGGTCCTGGAGGAGAATGTGGACAATGATCCCTGCAAGTTTGCACTCATGAACAGA 

GAGACTTCTGAGAGGGTTGTTCTGCAAGCCGCCAACGCTGACATCCAGCAGGCCTGGGTG 

CAGGACATCAATCAAGTCTTAGAAACACAGCGAGACTTTTTGAATGCACTGCAATCGCCC 

ATTGAGTATCAACGGAAAGAAAGGAGCACAGCTGTGATGAGGTCTCAACCTGCCAGGCTT 

CCCCAAGCCAGCCCCAGGCCCTACTCCTCTGTTCCTGCGGGCTCAGAGAAGCCCCCAAAG 

GGCTCCAGCTATAACCCACCTCTGCCTCCCCTGAAGATATCTACCTCCAATGGCAGTCCA 

GGGTTTGAATACCACCAGCCTGGGGACAAGTTCGAAGCCAGCAAGAACGACCTGGGAGGC 

TGCAATGGGACCTCGTCCATGGCCGTGATCAAAGATTACTATGCACTGAAGGAGAATGAA 

ATCTGTGTGAGCCAAGGTGAGGTGGTCCAGGTCCTCGCCGTCAACCAGCAGAACATGTGT 

CTGGTGTACCAGCCTGCCAGCGACCATTCCCCCGCCGCCGAGGGCTGGGTCCCAGGCAGC 

ATCCTGGCGCCCCTCACCAAAGCCACAGCAGCAGAAAGTAGTGACGGGAGCATCAAGAAG 

TCATGTTCATGGCATACTCTACGCATGAGAAAGCGGGCGGAAGTGGAGAACACGGGTAAA 

AATGAAGCCACAGGGCCTCGTAAACCCAAGGATATTCTGGGCAACAAAGTCTCTGTTAAA 

GAGACGAACAGTTCCGAGGAATCAGAGTGTGATGATCTTGACCCTAATACTAGCATGGAG 

ATCTTAAATCCAAATTTCATCCAAGAAGTGGCCCCAGAATTCCTTGTGCCCTTGGTGGAT 

GTGACCTGCTTGCTTGGGGACACAGTGATACTGCAGTGCAAAGTCTGTGGGCGGCCAAAG 

CCCACCATCACTTGGAAGGGTCCAGACCAGAACATCCTTGACACTGATAACAGCTCAGCC 

ACATACACGGTCTCCTCTTGTGATTCTGGAGAAATCACCCTGAAGATCTGTAATCTGATG 

CCCCAAGACAGTGGGATTTATACCTGCATAGCAACAAATGACCACGGGACCACATCAACG 

TCTGCAACAGTCAAAGTGCAAGGTGTTCCAGCAGCCCCTAACCGCCCCATTGCCCAGGAG 

AGAAGCTGCACCTCCGTGATTCTCCGCTGGCTGCCCCCCTCCAGCACAGGAAACTGCACT 

ATTTCTGGTTACACTGTGGAGTACAGAGAGGAAGGTTCTCAGATCTGGCAGCAGTCAGTG 

GCTTCGACCTTGGACACTTACCTCGTCATCGAAGACCTTAGTCCCGGGTGTCCTTATCAG 

TTCAGAGTCAGTGCCAGTAACCCCTGGGGAATCAGCCTTCCCAGCGAGCCCTGGGAGTTT 

GTGCGACTTCCAGAATACGATGCTGCTGCTGATGGTGCCACCATTTCTTGGAAGGAAAAT 

TTTGACTCAGCTTACACTGAGCTGAATGAAATTGGAAGAGGCCGTTTCTCTATAGTAAAG 

AAATGCATTCACAAAGCTACCCGCAAAGATGTGGCTGTGAAATTTGTTAACAAAAAAATG 

AAGAAGAAAGAACAGGCTGCCCACGAGGCTGCCCTGCTTCAGCACCTACAGCACCCCCAG 

TACATCACTCTCCATGACACCTATGAGTCCCCCACATCCTACATCCTGATCTTGGAACTG 

ATGGATGATGGCCGGCTCTTAGACTACCTTATGAATCATGATGAACTGATGGAGGAAAAA 

GTAGCTTTCTATATCCGAGACATCATGGAGGCTCTGCAGTACCTTCACAACTGCAGGGTT 

GCACATTTGGACATAAAGCCTGAAAACCTGCTCATTGACCTACGGATTCCAGTGCCTCGA 

GTGAAGCTCATTGACTTGGAGGATGCTGTCCAGATCTCGGGTCACTTCCACATTCACCAC 

CTGCTGGGGAACCCTGAGTTTGCTGCCCCAGAAGTCATTCAAGGCATCCCCGTCTCCCTG 

GGGACAGACATCTGGAGCATCGGGGTTCTGACATATGTCATGCTGAGTGGGGTCTCCCCC 

TTCTTGGATGAGAGCAAAGAGGAGACATGTATCAACGTATGCAGGGTGGATTTCAGCTTC 

CCCCATGAATACTTCTGTGGTGTGAGCAATGCTGCCAGAGATTTCATCAATGTGATCTTA 

CAGGAAGATTTTCGGAGGCGGCCCACAGCAGCCACATGCTTGCAGCATCCATGGCTGCAG 

CCCCATAATGGCAGCTACTCTAAGATCCCCCTGGACACCTCCCGCCTAGCATGCTTCATA 

GAACGTCGCAAGCACCAGAATGATGTGCGGCCTATCCCCAATGTCAAGAGCTACATTGTC 

AACCGGGTGAACCAAGGGACGTAG 
SEO ID NO: 4 5 5R72 8 2_H 

CGCCGCTGTTTGTCCTCGCGCGGCCCCGTCCACTGCCCTGCGGTTGCTCTGCGGGCTGAA 
AAGTTTCTCCCGGTGCAGAATTCCGGGCTCAGCGACAGCCTGCGCCGAGTGTGCGCACCT 
GTCGGAGACCCGCCAGTCCGCCGGCCCCGGCTTTGTTCGTGCGGAACTGTAGTGGTGAGA 
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AAAACTCCATGTCTGGGCACGCCTGGCTGATCTTCACCTCTTTCTTCTAGGACCTTCCTC 
TGGGCTGTCACGTGTGAATATGTGTCTAGTGCATCCTTAACCTGAGGACTTCACCAGTTC 
GAAATTACAGTTTTCACCATCAACTACCTTATCCTTTTTGGCCTGGTTTTCTTCCTCAAA 
CAGTGGAAACATTTTTAAAGTTGCTTTTGTTGCAGAGTTAAACAAATGGCTGATAGTGGC 
TTAGATAAAAAATCCACAAAATGCCCCGACTGTTCATCTGCTTCTCAGAAAGATGTACTT 
TGTGTATGTTCCAGCAAAACAAGGGTTCCTCCAGTTTTGGTGGTGGAAATGTCACAGACA 
TCAAGCATTGGTAGTGCAGAATCTTTAATTTCACTGGAGAGAAAAAAAGAAAAAAATATC 
AACAGAGATATAACCTCCAGGAAAGATTTGCCCTCAAGAACCTCAAATGTAGAGAGAAAA 
GCATCTCAGCAACAATGGGGTCGGGGCAACTTTACAGAAGGAAAAGTTCCTCACATAAGG 
ATTGAGAATGGAGCTGCTATTGAGGAAATCTATACCTTTGGAAGAATATTGGGAAAAGGG 
AGCTTTGGAATAGTCATTGAAGCGACAGACAAGGAAACAGAAACGAAGTGGGCAATTAAA 
AAAGTGAACAAAGAAAAGGCTGGAAGCTCTGCTGTGAAGTTACTTGAACGAGAGGTGAAC 
ATTCTGAAAAGTGTAAAACATGAACACATCATACATCTGGAACAAGTATTTGAAACGCCA 
AAGAAAATGTACCTTGTGATGGAGCTTTGTGAGGATGGAGAACTCAAAGAAATTCTGGAT 
AGGAAAGGGCATTTCTCAGAGAATGAGACAAGGTGGATCATTCAAAGTCTCGCATCAGCT 
ATAGCATATCTTCACAATAATGATATTGTACATAGAGATCTGAAACTGGAAAATATAATG 
GTTAAAAGCAGTCTTATTGATGATAACAATGAAATAAACTTAAACATAAAGGTGACTGAT 
TTTGGCTTAGCGGTGAAGAAGCAAAGTAGGAGTGAAGCCATGCTGCAGGCCACATGTGGG 
ACTCCTATCTATATGGCCCCTGAAGTTATCAGTGCCCACGACTATAGCCAGCAGTGTGAC 
ATTTGGAGCATAGGAGTCGTAATGTACATGTTATTACGTGGAGAACCACCCTTTTTGGCA 
AGCTCAGAAGCGAAGCTTTTTGAGTTAATAAGAAAAGGAGAACTACATTTTGAAAATGCA 
GTCTGGAATTCCATAAGTGACTGTGCTAA7^AGTGTTTTGAAACAACTTATGAAAGTAGAT 
CCTGCTCACAGAATCACAGCTAAGGAACTACTAGATAACCAGTGGTTAACAGGCAATAAA 
CTTTCTTCGGTGAGACCAACCAATGTATTAGAGATGATGAAGGAATGGAAAAATAACCCA 
GAAAGTGTTGAGGAAAACACAACAGAAGAGAAGAATAAGCCGTCCACTGAAGAAAAGTTG 
AAAAGTTACCAACCCTGGGGAAATGTCCCTGAGACCAATTACACTTCAGATGAAGAGGAG 
GAAAAACAGTCTACTGCTTATGAAAAGCAATTTCCTGCAACCAGTAAGGACAACTTTGAT 
ATGTGCAGTTCAAGTTTCACATCTAGCAT^ACTCCTTCCAGCTGAAATCAAGGGAGAAATG 
GAGAAAACCCCTGTGACTCCAAGCCAAGGAACAGCAACCAAGTACCCTGCTAAATCCGGC 
GCCCTGTCCAGAACCAAAAAGAAACTCTAAGGTTCCCTCCAGTGTTGGACAGTACAAAAA 
CAAAGCTGCTCTTGTTAGCACTTTGATGAGGGGGTAGGAGGGGAAGAAGACAGCCCTATG 
CTGAGCTTGTAGCCTTTTAGCTCCACAGAGCCCCGCCATGTGTTTGCACCAGCTT/^AAAT 
TGAAGCTGCTTATCTCCAAAGCAGCATAAGCTGCACATGGCATTAAAGGACAGCCACCAG 
TAGGCTTGGCAGTGGGCTGCAGTGGAAATCAACTCAAGATGTACACGAAGGTTTTTTAGG 
GGGGCAGATACCTTCAATTTAAGGCTGTGGGCACACTTGCTCATTTTTACTTCAAATTCT 
TATGTTTAGGCACAGCTATTTATAGGGGAAAACAAGAGGCCAAATATAGTAATGGAGGTG 
CCAAATAATTATGTGCACTTTGCACTAGAAGACTTTGTTAGAAAATTACTAATAAACTTG 
CCATACGTATTACAGCAGAAGTGCTTCAGTCATTCACATGTGTTCGTGAGATTTTAGGTT 
GCTATAGATTGTTTAAGACAGCTTATTTTAAATGTAGAAAAATAGGAGATTTTGTAACTG 
CTTGCCATTAACTTGCTGCTAAATTCCCAATGTATTGATTAAATCAATAAAAAACAGATG 

TTACTC 

SEQ ID NO: 4 6_SGK3 09_H 

GGGTCCGCAGCCCGCCCTCACAGGCCCTCCTCACTCCCCTAGGTAGATGGCCCCCTCAGG 
GCAGGCCCGGCGGACACCCCTCCCTCTGGCTGGCGGATGCAGTGCCTAGCGGCCGCCCTT 
AAGGACGAAACCAACATGAGTGGGGGAGGGGAGCAGGCCGACATCCTGCCGGCCAACTAC 
GTGGTCAAGGATCGCTGGAAGGTGCTGAAAAAGATCGGGGGCGGGGGCTTTGGTGAGATC 
TACGAGGCCATGGACCTGCTGACCAGGGAGAATGTGGCCCTCAAGGTGGAGTCAGCCCAG 
CAGCCCAAGCAGGTCCTCAAGATGGAGGTGGCCGTGCTCAAG7VAGTTGCAAGGTTCGGGC 
CTCGGGCAGGGGGATGGGAAGGAAGAGATGATGAAGCCAGGGGCTAAGAGAGGGAAGGAC 




WO 00/73469 



PCT/USOO/14842 



FIGURE 2LL 

TACAGGAAGTGCTATATGCTGGACTTCGGGCTGGCCCGGCAGTACACCAACACCACGGGG 

gaS?gc^gccccctcggaatgtggccgggtttcgagg^ 

"SSgLccgggagatgggccgccacgacgacctg 
ctggtggagtttgcagtgggccagctgccctggaggaagatcaaggacaaggaacaggta 

gggatca?caIggLaagtatgagcaccgg 

ScSSSg^ 
gIctSL^^^ 

SaaSccggatcaacatcggcaaagtaactgccgccagg 
tcgSgStcttccatcatcatcatcctctgtctccttcacactgaggagaccatccgcc 

ggcgcgccagatatcacacgtgcca^ 

gag?aggS2tcgca^^ 

ag?agc?g?cgccgccgc 

^SgS^tcgaIgccgcgctcgggtcccgccgcagtcggcggtgggggatgggcaggca 
SggcggSccS 

rrCCTCCTCCAGAAACCCCTGCGTCCGCCACGGCCCAGGTTAAATGGAAACCACCCTTGG 

gSSgSSc^gtagctgttctaccatatcagtgtattgcaatgagtggg^ 
ga^cIgSggatatcctgagtgttggaatcctagtgaa^^ 

AarATTGGGGGTGGGGGCTTTGGAGAAATTTACGATGCCT 

^TGTTTTGAAAAAGCTGCAAGGGAAAGACCATGTTTGTAGATTTATTGGCTGTGGGAGG 

SgScga^ctLgtggtcatgcagttgcagggtcggaatctggcagatcttcgc 
SggaStgaaagcattca^^^ 

™SgggaSgttcgttat^ 
tcgagaaaaataaaggacaaggagcaagtaggctctattaagg 

rTPRTrTTGAAACATCTCCCTCCAGAATTCAGCATCTTTCTAGACCATATCTCTTCTTTG 
GATTATTTTA^AAAACCAGACTACCAGCTTCTTACATCCGTGTTTGACAATAGCATCAA 

acS™ISta^gLagtgacccttttgactgggagaagactgg^ 

^ScCACCACTACTTCTACCACCCCTCAGTTGCACACTCGCTTGACCCCTGC 
ATTCGA^TTGCCAATGCTACTCCCATCCCTGGAGACTTGCTTCGAGAAAATACAGATG^ 

pI™™cctggatctctgggacacccccgtccccaggagaag 
atgga^gccaacaaaaacaagataaagcttggaatttgtaaggctgc 

AACAGCCATG^CCAG^CAAATGGTCTTCTCAATGCTCCAAGCCTTGGGTCACCAATTCGT 
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GTCCGCTCAGAGATTACTCAG^TC^ 
ATTCACAGCTTTGAGCTGGAAAA^ 

ttccttgagacctggtataaaatagtgta™ 

ATTATTGTTGTCATTGTTGTTATTATTATTGT^^^ 
TAAGAAATCCTTGCATTTTTGTAACACTGA^ 
TTTAATTATTATTATCCTGATTCTTAATT^ 
TGGAAGCTGAGCAGTAAGAGCTGATGACTTCCTAAC^ 

TTTTACAGTAACTTTGTTTAGAAAGT^ 

GGCATGATAAAGATGGCATTAG^ 

AATTCATACACAAGGGACGGTGAGTTCA^ 

ATACTG TGGGAA^CAGATAC™ 

SEQ ID NO: 48 - AA43 "^-" rTrTTTTTGAT TTAGAGGCGAAATCAGCACGTGGT 

acttttactatattctttgagatgac^ 

GGCTCAAATCTCCTTATGGATAGTGTTTCTTCCTT^ 
CGGGGCCTGGCGTACATCCACCACCAACACGTTOT^ 

cagtttagcaaatgctggtgaaaagaaagggcgagat^ 

SEQ ID NO: 4 9_AA626859 H TATGGCAAAC ACTTCAAGCTCTTAA 

aaatggagttgctgatggagtgatcaa^gcgtatta^ 

TTTCTGTCATATACATAACTGTATTCACAGAGATATAAAAC GATT CCAGGAGA 

^Lgcaaggaataatcaagatttgt^^ 

TGCCTACACCGATTATGTAGCTACGAGATGGTAC^ 
TACTCAGTATGGTTCTTCAGTCGATATATGGGCTATTG^ 

GACAGGCCAGCCACTGTGGCCTGGAAAATCA 

AACACTAGGAAAATTAATCCCAA^ 

TGGCATCAGTATACCTGAGCCAGAAGACATGGAAAC GAA TCCAGATGACAGATT 
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TGAAAATTAAGATGCCTTCTAGAATTGGTTTGCTCTGATCATTGCTGATTCCTTTCCCCA 
TGCTTTTACAT 

CATCGAGGTGTTCAGAAGAAAGAGA^GATGCATCTAOT^^ 
TGTAAAACCTGAAAACATCCTAATAACCAAGCA^GGGATGAT 

GCGCCAGCAGAATCAACTGCTGCCTCTTATTC^ 

ScTgS 

CATACCACAGTAACGCCCCTGGATCCCTGGCT^ 

Statctaaggatct^a^g^™^^^^ 
tccgtgtttgtatcaaggggc^ 



ccaccggccatttctctaaatotcattctg mtatttgaaattgtagctattgttaa 



TTCTTTAAATTATTTT. 



•AATAAAAGCCAGAAACATT 
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GCATCCCTGGGCACAGTTATC^CCT^GTGGAGT^ 
GACACCTTCTCCGATGACATGGCCTTCA^CTAGAC^^ 
GGATCAGATCGGAGCGACCGCCTGCA^C 



GACTTACTAAAAGCTAAACAGACC^^ 
GGATCGATGAAGGACCGGATATCGGGAAGTTC^^ 

TATGGGAAGGCGCAGGTAGCCAAA^^ 
AAGGAGAAGACCAGGAAAG^CGGGAGCTGA^ 

AGGAGCCCCCACAGGAA(^^^^^ TACCTCGAG cAATTATGAC 
TCTTATGGCCAAGATTATGACCTTAGTCCCTCAC^AJ. TCA GTCCCCCTTACAAG 
. TCCTACAAGAAAAGTCCTGGAAGTACCTCGAGAA 
GAGCCTTCGGCCTACCAGTCCAGCACCCGGTC^ 

TCTGTCAGTCCCTATAGC^^ 

GGGCGATCGCCCAGTCCCTATGGTCGAAGGCGGTCCAGC^ 

TCTCTGAGTCGGAGTCCACTCCCCAGTAGGAAATCCATGAAGl GCA GTCGT 

gcatScaagaca^ 

CATTCCAGTATCTCACCTGTCAGGCTTCCACTTAA TGGATG GAAAGGAGTCC 

AGGAAAAAGAAGGAAAGAGCAGCTGCTG^ 

AAGGGTTC*CCTGTATTTTTGCCTA^^ 

GGTTTGGAGTCTAAAAAGTTACCCAGAAGTGTAAAATTG^ AGATA CAGGGAAA 

gaIctggtgaatgtaacacatctaaac^ 

GTAAAGTTGGATGAGAACTCCGAGAAGCATCTTGTlAaa w^QGAGACAGAAACA 

AGAGACTCTAAACGCATAGC^^^ 
TCAGAAAAGGAGACCCCTCCACCTCTTCCCAC^AA 

ACTACTACCCCTCCACCTCAGACACCCCCTTTGGUft TCAGGT TCCTGCTTCCAGT 
cScAGCAACCACCTCTGCCTCCTTCTCAGCCAG^^^ 

SStcaactttgccc^ 

TCTCAGCCCCCTGTACAGGTTTCTGTGAAGACTCA^ 

CACCTGAAAACTTCAACGTTGCCTCCTTTGCC 

GACATGGATAGTCCAAAAGAAACTCTTCCTTCA^ 

AGGACACGTCACTTACTCACAGACCTTCCTCTCCC1 AGCAA CCATAT A AAAAG 
TCTCCCCC A GACTCTCCAGAACCAAAGGCAATCAC A G^ actggggg 

IgIcSaaaatttgttgtcct™ 

AAACGCTGTGTGGACAAGTTTGA^ 

MGTGTICTAACATTTTGCTGARTRRCAGIGGGC^T^ c^gtcaTTACTTTGTGG 
GCTCGGCTCTATAACTCTGAAGAGAGTCGCCCTTRCftU CATJsGAT0T TTGG 

Sccgacctccagaactactgcta^^ 

AGCTGTGGATGTAITCTTG^CTATTCACAARGAA TTGTCCAGCTG TGTGG 



WO 00/73469 



PCTAJS00/14842 



FIGURE 2PP 



CGTCTACGAGAAGAA.TTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATG 
CTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTT 
AAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCAT 
GAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCA 
CCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAG 
AACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGAT 
GCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTA 
AACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGGTTAACATC 
CACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTG 
ACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAG 
GAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGATGACCCTTGAAGCTTCA 
AGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACC 
CAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGG 
CCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTT 
CCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCT 
CTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCC 
TTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCAC 
CAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAAC 
ACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCA 
GCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTG 
AGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGAC 
CAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTC 
CTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTAT 
GGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGG 
GGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCA 
AGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAA 

SEQ ID NO: 52_AA78 92 3 9_H 

TGAAAATGGAGATGTATGAAACCCTTGGAAAAGTGGGAGAGGGAAGTTACGGAACAGTCA 
TGAAATGTAAACATAAGAATACTGGGCAGATAGTGGCCATTAAGATATTTTATGAGAGAC 
CAGAACAATCTGTCAACAAAATTGCGATGAGAGAAATAAAGTTTCTAAAGCAATTTCATC 
ACGAAAACCTGGTCAATCTGATTGAAGTTTTTAGACAGA71AAAGAAAATTCATTTGGTAT 
TTGAATTTATTGACCACACAGTATTAGATGAGTTACAACATTATTGTCATGGACTAGAGA 
GTAAGCGACTTAGAAAATACCTCTTCCAGATCCTTCGAGCAATTGACTATCTTCACAGTA 
ATAATGTAATCATTCATCGAGATATAAAACCTGAGAATATTTTAGTATCCCAGTCAGGAA 
TTACTAAGCTCTGTGATTTTGGTTTTGCACGAACACTAGCAGCTCCTGGGGACATTTATA 
CGGACTATGTGGCCACACGCTGGTATAGAGCTCCCGAATTAGTATTAAAAGATACTTCTT 
ATGGAAAGTATGTGCCTGTGGATATCTGGGCTTTGGGCTGTATGATCATTGAGATGGCCA 
CTGGAAATCCCTATCTTCCTAGTAGTTCTGATTTGGATTTACTCCATAAAATTGTTTTGA 
AAGTGNGATTCATGCCAGAACTGAAAGCTAAATTACTGCAGGAAGCAAAAGTCAATTCAT 
TAATAAAGCCAAAAGAGAGTTCTAAAGAAAATGAACTCAGGAAAGATGAAAGAAAAACAG 
TTTATACCAATACACTGCTAAGTAGTTCAGTTTTGGGAAAGGAAATAGAAAAAGAGAAAA 
AGCCCAAGGAGATCAAAGTCAGAGTTATTAAAGTCAAAGGAGGAAGAGGAGATATCTCAG 
AACCAAAAAAGA7VAGAGTATGAAGGTGGACTTGGTCAACAGGATGCAAATGAAAATGTTC 
ATCCTATGTCTCCAGATACAAAACTTGTAACCATTGAACCACCAAACCCTATCAATCCCA 
GCACTAACTGTAATGGCTTGAAAGAAAATCCACATTGCGGAGGTTCTGTGACAATGCCAC 
CCATCAATCTAACTAACAGTAATTTGATGGCTGCA7^ATCTCAGTTCAAATCTCTTTCACC 
CCAGTGTGAGGTTAACTGAAAGAGCAAAAAAGAGACGCACTTCTTCACAATCTATTGGAC 
AAGTTATGCCTAATAGCAGGCAAGAGGATCCAGGTCCTATTCAAAGCCAAATGGAGAAGG 
GTATATTTAATGAGCGAACAGGTCACAGTGACCAAATGGCAAATGAGAACT^AAAGGAAGC 
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AGTCAAAAGATACA^GGMTGG^TTAMCAG^ TGGATCAAAATC AAGAAAAAC 
AGAAAACAGAGTCATCTAAGATACCM^ 

AAGAGTTTATTCCCTOTC^G^CTGCCTGC TTTGRAGAGftAAC AG 
TTCCCGTTTTATTTATCTACAATAGAAGTGTGATpTG^G 

CTTGAA 

SEQ ID NO: 53 _ AA124976 -™.^„i ia TTrATCCTGCTGAGAGGACATCATCTACT 
CTGGCAGATATAGTTCATGCTTGTTTACA^TTGATCCTGCT 

GATCTTTTGCGTCACGATTACTTTACTAGA^ 
CTGAGAGCTAAATTATTACAGGAAGCA^ 

AAAGTCAGAGTCATTAAGGCCAAAGGGGGCAA^GGAGA 

GAGTATGAAGGCGACCACCGCCAGCAGGGCACA^ 

GACAAGAAGCCTTCTGTCTTGGAACTGAG^ 
GGTGTCAAAGAAGACCCACACGCTGGGGGTTGTATGA^^^ 

AGCAGTAATTTGTTGGCCGCAAATCTCAGTTCA^ 

ACTGAAAGAACAAAAAAGAGACGCACTT^ 

AGCAGACAAGAGGACACAGGTCCCACAC^ 

CGAACAGGTCAGAATGACCAAATATCGAGTGGG^ 

TGCGACAGGAAAGAATTCCATTTCCCTG^ 

AAAGGGATGGAAGTTAAACAGATA^^ 

TCTAAAATACCAACTTTACTTAGTATGGACCCAAAT^ AT AGTGCTTT 
GGCGATTGTGAGGGGAAGAATTT^ 
GTCTTTTAAGTAATCTTAAAAATACAAGCTT^ 
ACTAGAATGTACATAGGTTGCTGC^GATA^ 

CTATTTTTTTGGTTTTGCTAGCAAAATTTTCACAAllii CTGTCTGCT TC CTTCCTTGC 

tSStgatgctgtgatttgaaattataaagtcac^ 

TGTTCCTAGGTTAAACTCCTTGAGATG^CTOnTCCTGCA 
GTTAACAT 

SEQ ID NO: 54_AA575635_M CCRK M GGCCCGGGTC TTCTCTCCGGAT 

agcgcctcaggccagctcaagatagctgacttt 

GGTGGTCGCCTCTACACACATCAGGTGGCCACC^ 
TATGGCGCTCGGCAGTATGACCAGGGCGTTGACC^ 
GAGCTGTTGAATGGGTCCCCCCTGTTCCCG^ 
GTGCTTCGCATCCTGGGTACCCCGAGTCCTCGAGT^ 
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GCTGTGTCCTTCTCCAGGGCCACCCCTCAGTGGTGC^^ 

AGGCTCTGTCCCCTCTTCAAGGACATTGGTA^ 

TATAAGCTGTCTTCATACTGGGGACACAGCTGGGAAG^^^ 

CCACTGGGTCAGGATTTGAGGTTCATATAAAAGCCCTGGGT^ 

TTGTCTGTTGCTGTTAGGGAAAGGACAATG^^ 
CAGAAGGGGCCTCCTGTGAAGGCCATTTGGGTCCT^ 

CTTGAGTGCTCATTTGGGAGCGAGGGTCCAG^ 
(ATTGTAAACCACATCM^ 

TTGAATGAGCGAGATTATCGGGACCGGAGATA^ 
GAAGGATATGTTCCTAGACATTATCACAGAGACATT^ 
AGTAAATCTTCAGTCCGCAGCAGGAGAAGCAGTOT 
TCAAGTCATCAGTCACGTTCGNATGAAATCGTGGACACTTT^ 

AAAGTTGTAGAGTGCATTGATCATGGCATGGATG^ 

AAAAATGTAGGCCGTTACGGTGAAGCA^^ 

AATAGTAGTGATGGGAATAGTGTCTTGCG^^ 

CATGGTCATGTTTGTATTGTGTTTGAACTACTGGG^ 

GAAAAGAGGTTTCTGCCATTTCAAATTGACCACAT^ 

CTTGGTTTCACAGTCTTTCAGACTCATGATAGTAAA^AG^^^^^^^^^^^^^^^ 
ATATTAGGACCCATACCACAACACATCATTCAGAAAACAA TTAGGAGACGC TGC 
CATAACCAGCTAGATTGGGATGAACACAGTOCTGCTGGT^ 

TCTTTTTGAAATTACCATOTAAA^CTTTGAAATA^ 
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ACTTAACTTTAAAAGTTTTTCTTCTGTAATTGTTGAACGGGTGATTATTATTAACTCTAG 
ATAAGCAGGTACTAGAAACCAAAACTCAGAAAATGTTTACTGTTAGAATTCTATTAAATT 
TTAAGTGTTGTATTCTTTTTCATTGGGTGATGTCAGGGTGATAACCAGACATTCATGGAA 
AGGCATGCAGTTTGTCCATTGTGACAGTTTGTTTAATAAAACCACATACACACTTTATTT 
AAGATTAAAATCTAACTGGAAAGTCAGCTTGGAAAATGGACATTTCCAAGTATGTTTGGT 
GAGTCACAGATATAAAAATAGAAATTCTGATGAGAGGTTTCAGTTTTTAATACCAAGTCC 
TTAGGAGTCTTAACATTGGCCAGCATCTGTTTATCAAATGACATAAATACGTAAACCTAT 
AAGAATTAAGTTTATTAATTAGGCAATTTATGTCTGTGATAATTCTTACGGGAGAAAGAG 
GATTTGATTGGAAAGCAGTTTGGGAAGAAAGTGCTGCTGAAATTTCCAGAATTTAATTGA 
TTGGTTACATAAACTTTTTGACTTCAAT 

SEQ ID NO: 56_AA557536_H 

AGTAAGGCCCCGCGGGCGTCCTGGCCGCCATGTGCACCGTAGTGGACCCTCGCATTGTCC 

GGAGATACCTACTCAGGCGGCAGCTCGGGCAGGGGAGAACATTCCGGGAAATCACGCTCC 

TCCAGGTGAGTGGCCTGGGCCCTCCAGTCCAATCCCCTTGCCCAGGTACAGATCTCTCCA 

GACAGGAGAGAAACTGGCCTTCTTGGGCCCCAGAGCACAGCCCCTCCTGGCCTTCCAGCC 

GCCTCCGACTCTCTCCCCAGGAGTTTGGGGACCATCCCAACATCATCAGCCTCCTTGACG 

TGATCCGGGCAGAGAACGACAGGGACATTTACCTGGTGTTTGAGTTTATGGACACTGACC 

TGAACGCAGTCATCCGGAAGGGCGGCCTGCTGCAGGACGTCCACGTGCGCTCCATCTTCT 

ACCAGCTCCTGCGGGCCACCCGGTTCCTCCACTCGGGGCACGTTGTGCACCGGGACCAGA 

AGCCGTCCAATGTGCTCCTGGATGCCAACTGCACAGTGAAGCTGTGTGACTTTGGCCTGG 

CCCGCTCCCTGGGCGACCTCCCTGAGGGGCCTGAGGACCAGGCCGTGACAGAGTACGTGG 

CCACACGCTGGTACCGAGCACCGGAGGTGCTGCTCTCTTCGCACCGCTACACCGCTTCCT 

GCCCCAGATACACCCTTGGGGTGGACATGTGGAGTCTGGGCTGTATCCTGGGGGAGATGC 

TGCGGGGGAGACCCCTGTTCCCCGGCACGTCCACCCTCCACCAGCTGGAGCTGATCCTGG 

AGACCATCCCACCGCCATCTGAGGAGXXXAGGCCACGACAGACGCTGGATGCCCTCCTAC 

CGCCAGACACCTCCCCAGAGGCCTTGGACCTCCTTAGGCGACTCCTGGTGTTCGCCCCGG 

ACAAGCGGTTAAGCGCGACCCAGGCACTGCAGCACCCCTACGTGCAGAGGTTCCACTGCC 

CCAGCGACGAGTGGGCACGAGAGGCAGATGTGCGGCCCCGGGCACACGAAGGGGTCCAGC 

TCTCTGTGCCTGAGTACCGCAGCCGCGTCTATCAGATGATCCTGGAGTGTGGAGGCAGCA 

GCGGCACCTCGAGAGAGAAGGGCCCGGAGGGTGTCTCCCCAAGCCAGGCACACCTGCACA 

AACCCAGAGCCGACCCTCAGCTGCCTTCTAGGACACCTGTGCAGGGTCCCAGACCCAGGC 

CCCAGAGCAGCCCAGGCCATGACCCTGCCGAGCACGAGTCCCCCCGTGCAGCCAAGAACG 

TTCCCAGGCAGAACTCCGCTCCCCTGCTCCAAACTGCTCTCCTAGGGAATGGGGAAAGGC 

CCCCTGGGGCGAAGGAAGCGCCCCCCTTGACACTCTCGCTGGTGAAGCCAAGCGGGAGGG 

GAGCTGCGCCCTCCCTGACCTCCCAGGCTGCGGCTCAGGTGGCCAACCAGGCCCTGATCC 

GGGGTGACTGGAACCGGGGCGGTGGGGTGAGGGTGGCCAGCGTACAACAGGTCCCTCCCC 

GGCTTCCTCCGGAGGCCCGGCCCGGCCGGAGGATGTTCAGCACCTCTGCCTTGCAGGGTG 

CCCAGGGGGGTGCCAGGGCTTTGCTTGGAGGCTACTCCCAAGCCTACGGGACTGTCTGCC 

ACTCGGCACTGGGCCACCTGCCCCTGCTGGAGGGGCACCATGTGTGAGCCGCCCTACTCC 

CTTCACCTGGCCCTCTGTTCCTGCCCCAGCNCCTTCCCCAGACCCCTCTCCAGTCTCCTG 

CACCCCTTAGCCCTCCCTGCTTTGCCTGGCCCGTTGAAGTTCCAGGGAGCTTGCCCGGGT 

CTCCTCGGGGGAGCAGATGAGGGCCCTGCCC 

SEQ ID NO: 57_N2 8606_H, MOK_H 

ATGAAGAACTATAAAGCAATTGGCAAAATAGGAGAGGGAACGTTTTCTGAAGTTATGAAG 
ATGCAAAGCCTGAGAGATGGAAACTACTATGCATGTAAACAAATGAAGCAGCGCTTTGAA 
AGTATTGAGCAAGTCAACAACCTACGAGAGATCCAAGCACTGAGGCGCCTGAATCCGCAC 
CCAAACATTCTTATGTTGCATGAAGTGGTTTTTGACAGAAAATCTGGTTCTCTTGCACTA 
ATATGTGAACTTATGGACATGAATATTTATGAGCTAATACGAGGGAGAAGATACCCATTA 
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TCAGAAAAAAAAATTATGCACTATATGTACCAGTTATGTAAGTCCCTGGATCATATTCAC 
AGAAATGGAATATTTCACAGAGATGTAAAACCAGAAAATATACTAATAAAGCAGGATGTC 
CTGAAATTAGGGGACTTTGGCTCCTGCCGGAGTGTCTATTCCAAGCAGCCGTACACGGAA 
TACATCTCCACCCGCTGGTACCGGGCCCCGGAGTGTCTCCTCACTGATGGGTTCTACACG 
TACAAGATGGACCTGTGGAGCGCCGGCTGTGTGTTCTACGAGATCGCCAGTCTGCAGCCC 
CTCTTTCCTGGAGTAAATGAACTGGACCAAATCTCAAAAATCCACGATGTCATCGGCACA 
CCCGCTCAGAAGATCCTCACCAAGTTCAAACAGTCGAGAGCTATGAATTTTGATTTTCCT 
TTTAAAAAGGGATCAGGAATACCTCTACTAACAACCAATTTGTCCCCACAATGCCTCTCC 
CTCCTGCACGCAATGGTGGCCTATGATCCCGATGAGAGAATCGCCGCCCACCAGGCCCTG 
CAGCACCCCTACTTCCAAGAACAGAGGAAAACAGAGAAGCGGGCTCTGGGCAGCCACAGA 
AAAGCTGGCTTTCCGGAGCACCCTGTGGCACCGGAACCACTCAGTAACAGCTGCCAGATT 
TCCAAGGAGGGCAGAAAGCAGAAACAGTCCCTAAAGCAAGAGGAGGACCGTCCCAAGAGA 
CGAGGACCGGCCTATGTCATGGAACTGCCCAAACTAAAGCTTTCGGGAGTGGTCAGACTG 
TCGTCTTACTCCAGCCCCACGCTGCAGTCCGTGCTTGGATCTGGAACAAATGGAAGAGTG 
CCGGTGCTGAGACCCTTGAAGTGCATCCCTGCGAGCAAGAAGACAGATCCGCAGAAGGAC 
CTTAAGCCTGCCCCGCAGCAGTGTCGCCTGCCCACCATAGTGCGGAAAGGCGGAAGATAA 

SEQ ID NO: 58_AB023 153_H , ICK_H 

ATGAATAGATACACAACAATCAGGCAGCTCGGGGATGGAACCTACGGTTCCGTCCTGCTG 
GGAAGAAGCATTGAGTCTGGGGAGCTGATCGCTATTAAAAAAATGAAAAGAAAATTTTAT 
TCCTGGGAGGAATGCATGAACCAACGGGAGGTTAAGTCTTTAAAGAAGCTCAACCATGCC 
AATGTAGTCAAATTAAAAGAAGTTATCAGGGAAAATGATCATCTTTATTTTATCTTCGAG 
TACATGAAGGAAAATCTTTACCAGCTCATTAAAGAGAGAAATAAGTTGTTTCCTGAGTCT 
GCTATAAGGAATATCATGTATCAGATATTACAAGGACTCGCATTTATTCACAAACTCGGC 
TTCTTTCATCGAGACTTAAAGCCTGAGAACCTCCTCTGCATGGGACCAGAACTTGTGAAA 
ATTGCAGACTTTGGTTTGGCCCGAGAAATACGATCAAAACCTCCATATACAGATTATGTA 
TCTACCAGATGGTACAGGGCTCCAGAAGTACTCCTGAGGTCTACCAACTACAGCTCCCCC 
ATTGACGTCTGGGCGGTGGGCTGCATCATGGCAGAAGTTTACACCCTCAGGCCACTCTTC 
CCTGGAGCCAGTGAAATTGACACAATATTCAAAATTTGCCAAGTGCTGGGGACACCAAAA 
AAGACTGACTGGCCTGAAGGCTATCAACTTTCAAGTGCAATGAACTTCCGTTGGCCACAG 
TGTGTACCCAATAACTTAAAGACCTTGATTCCCAATGCTAGCAGTGAAGCAGTCGAGCTC 
CTGAGAGACATGCTTCAGTGGGATCCCAAGAAACGACCAACAGCTAGTCAGGCACTTCGA 
TATCCTTACTTCCAAGTTGGACACCCACTAGGCAGCACCACACAAAAGCTTCAGGATTCA 
GAAAAACCACAGAAAGGCATCCTGGAAAGGGCAGGCCCACCTCCTTATATTAAGCCAGTC 
CCACCTGCCCAGCCACCAGCCAAGCCACACACACGAATTTCTTCACGACAGCATCAAGCC 
AGCCAGCCCCCTCTGCATCTCACGTACCCCTACAAAGCAGAGGTCTCCAGGACAGATCAC 
CCAAGCCATCTCCAGGAGGACAAGCGAAGCCCGTTGCTTTTCCCATCCCTCCACAACAAG 
CATCCACAGTCGAAAATCACAGCTGGCCTGGAGCACAAAAATGGTGAGATAAAGCCAAAG 
AGTAGGAGAAGGTGGGGTCTTATTTCCAGGTCAACAAAGGATTCAGATGATTGGGCTGAC 
TTGGATGACTTGGATTTCAGTCCATCCCTCAGCAGGATTGACCTGAAAAACAAGAAAAGA 
CAGAGTGATGACACTCTCTGCAGGTTTGAGAGTGTTTTGGACCTGAAGCCCTCTGAGCCT 
GTGGGCACAGGAAACAGTGCCCCCACCCAGACGTCATATCAGCGGCGAGACACGCCCACC 
CTGAGATCTGCAGCCAAGCAGCACTATTTGAAGCACTCTCGATACTTGCCTGGGATCAGT 
ATAAGAAATGGCATACTCTCGAATCCAGGCAAGGAATTTATTCCACCTAATCCATGGTCT 
AGTTCTGGCTTGTCTGGAAAATCTTCAGGGACAATGTCAGTAATCAGCAAAGTAAATTCA 
GTTGGTTCCAGCTCTACAAGTTCTAGTGGACTGACTGGAAACTATGTCCCTTCCTTTCTG 
AAAAAAGAAATCGGTTCTGCTATGCAGAGGGTACACCTAGCACCTATTCCAGACCCTTCC 
CCTGGTTATTCCTCCCTGAAGGCCATGAGACCTCATCCTGGGCGACCATTCTTGGACACC 
CAGCCTAGAAGCACTCCTGGGTTGATACCACGGCCTCCAGCCGCCCAGCCAGTGCATGGC 
CGGACAGACTGGGCTTCCAAGTACCCATCCCGGCGGTGA 
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SEQ ID NO: 5 9_AA8 3 994 0_M 

AGCAGCAACAATGGTGGCATGAGTGCAGAGGAGGAGATAGGGCCTGGGGCTGAGCCTATG 
AGAGGACCAAGCTTGGCTACAAGGGACTGGAGAGATGAGACTGTTGGGACCACAGACCTG 
CAGCAAGGCATAGACCCAGGAGCAGTGAGCCCTGAGCCTGGGAAGGACCACGCAGCCCAG 
GGCCCAGGAAGAACTGAAGCTGGAAGGGTATCTTCTGCTGCAGAGGCTGCCATTGTGGTT 
CTAGATGACAGCGCAGCACCCCCAGCCCCTTTTGAACACCGGGTAGTGAGCATCAAAGAT 
ACCCTGATCTCAGCAGGCTACACGGTATCCCAACATGAAGTCTTAGGAGGGGGTCGGTTT 
GGCCAGGTGCACAGGTGTACAGAGAGGTCTACAGGCCTTGCACTGGCAGCCAAGATCATC 
AAAGTGAAGAACGTAAAGGACCGGGAGGATGTGAAGAATGAGGTCAACATCATGAACCAG 
CTCAGCCACGTAAACTTGATCCAACTTTATGATGCGTTTGAGAGCAAGAACAGCTTCACT 
CTGATCATGGAGTATGTGGATGGAGGCGAACTCTTTGACCGGATCACGGATGAGAAGTAC 
CACCTCACTGAGTTGGATGTGGTCTTGTTCACGAGGCAGATCTGTGAGGGTGTGCATTAC 
CTGCATCAGCACTATATCCTGCACCTGGACCTCAAGCCTGAGAACATATTGTGTGTCAGC 
CAGACAGGGCATCAAATTAAGATCATTGACTTTGGGCTGGCTAGAAGATACAAGCCTCGG 
GAGAAGCTAAAGGTGAACTTTGGTACTCCGGAGTTCCTGGCCCCAGAAGTTGTTAACTAT 
GAGTTTGTGTCATTTCCAACAGACATGTGGAGTGTGGGAGTTATCACCTACATGCTACTC 
AGTGGTTTGTCCCCATTTCTAGGGGAGACAGATGCAGAGACCATGAATTTTATTGTGAAC 
TGCAGCTGGGATTTCGATGCTGATACCTTCAAAGGGCTGTCGGAGGAAGCCAAGGACTTT 
GTTTCCCGGTTACTGGTCAAAGAGAAGAGCTGTAGGATGAGCGCCACACAGTGCCTGAAA 
CACGAGTGGTTAAATCACCTGCCTGCCAAAGCCTCGGGCTCCAACGTTCGCCTCAGATCC 
CAACAACTGCTGCAGAAATATATGGCTCAGAGTAAATGGAAGAAACATTTCCACGTGGTG 
GCTGCAGTCAACAGGCTACGGAAATTTCCAACGTGTCCCTAATCTTCAACTCTGGTGTTC 
CACTGGGCCTGGGAATTCTTGAGGCAACACGT^AGTGGTAATATGAAGAGATTACTCAAGA 
TTTTATGTAGATTGGCGCTTTGCTATTATTGATTTTTCTTATTTTGCAAAGAATGATGGA 
AGGAAGCAAGAAAGAAAGAAAAGAAGAAAAGGGGGAAGAAAAGGAAAAGGCAGAAAGCAA 
GGAAACAGGCTACGTTGTTGCTCTTCTTGTAGGTGAAAGTGTTTTTATTAAAAGCCCTAG 
GAATGTTTTTCTGCCTCGTAAGGTCAGCAGGTCTCATATGCTGCTTGCTACCCCGCACCC 
TTCCTTTTGGTAATAAGAGCAGGCACGCTCAGGATGGGCAGGGAAATCCTACTTGGCTTT 
TGGTCAAATTTGAATTCTAAACTTGTCATGATTAAAGAAGCCAGTAGGGAGGGAGGTATG 
GAAGAGGGAGGAATTAGGTCCAACAGTGGGGGATGAATTTGACCGAAACATTGTATAAAA 
TTCTTAAAGAATTAATAAAATATATTTTTAAAGGAG 

SEQ ID NO: 6 0_AA4 6 0 1 3 2_H 

GGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGTAACCACTTACAGGCCGGAAG 
TGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCAGGATTGCCGAAGAAGTCCCA 
GGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAGAGACAGCTGATCGGTTGGAG 
CTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTACGCCGGCCGATGGCGAGGAG 
CCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGCGGAGCAGCCGCTTCTTGAGC 
GGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGC 
CGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCG 
CGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCT 
GGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTTCCAACTGCTTATATATGG^ 
GAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGTCCACTATGGAGACTGAAAAA 
ACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGCAGGTTTTGGCTCGAATGCAC 
GATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACATGCTCCTGAAACCCCCCCTG 
GAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTTTCATTTCAGCACTTCCAGAG 
GATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGTACCCATCCCAACACT 
GAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCTCCTCCAAAAAGGCCAGGCCA 
GTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAAAGAGGTCCATGGTTGGGTAG 
AAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTTTCAAAGTAAATTTGAAGAAA 
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TGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAGATATTTTTAAGTGGTATGTG 
ATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACTATGTGTCTAAGTCATGTTCT 
AGGCAGT^ATTGGGTATTTAAAGTAAATTGAGGACAGGCTTCTCCCAGATTGTGACATGTA 
TATCTCAGATACATGGGTGTGGCATTGAACCACATAATGAGAACATTATTCTCTTTTTAG 
TCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTCGCTGAGCTTACTGGCCCTCT 
AACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTATATTTATTTTGAAACCAGTTT 
AATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATACAGCATGG 

SEQ ID NO: 61_SGK034_H 

CAGAGAGAGAAGGTAAACCAAGGGAACATGCCAGGGCTTCAGAGCACCTTCCTAGCCATG 
GACACGGAGGAGGGGGTAGAGGTGGTGTGGAACGAGCTCCACTTCGGAGACAGGAAGGCC 
TTCGCGGCGCACGAGGAGAAGATCCAGACCGTGTTCGAGCAGCTGGTGCTGGTGGACCAC 
CCGAACATCGTGAAGTTGCACAAGTACTGGCTGGATACCTCTGAGGCCTGCGCGAGGGTC 
ATCTTCATCACAGAGTACGTGTCATCAGGCAGCCTCAAGCAATTCCTCAAAAAGACCAAG 
AAGAACCACAAGGCCATGAACGCCCGGGCCTGGAAGCGCTGGTGCACGCAGATCCTGTCT 
GCGCTCAGCTTCCTGCACGCCTGCAGCCCCCCAATCATCCACGGGAACCTGACCAGCGAC 
ACCATCTTCATTCAGCACAACGGCCTCATCAAGATCGGCTCCGTGTGGCACCGAATCTTC 
TCCAATGCACTTCCAGATGATCTCCGAAGCCCCATCCGCGCTGAGCGAGAGGAACTTCGG 
AACCTGCACTTCTTCCCCCCAGAGTATGGAGAGGTGGCCGATGGGACCGCTGTGGACATC 
TTCTCCTTTGGGATGTGTGCGCTGGAGATGGCTGTACTGGAAATCCAGACCAATGGGGAC 
ACCCGGGTCACAGAGGAGGCCATTGCTCGCGCCAGGCACTCGCTGAGTGACCCCAACATG 
CGGGAGTTCATCCTTTGCTGCCTGGCCCGGGACCCTGCCCGCCGGCCCTCTGCCCACAGC 
CTCCTCTTCCACCGCGTGCTCTTCGAGGTGCACTCGCTGAAGCTCCTGGCAGCCCACTGC 
TTCATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTGGAGGAGAAGACCAAGGCCATG 
GACCTGCACGCGGTCTTGGCGGAGCTTCCCCGGCCCCGCAGGCCCCCGCTGCAGTGGCGG 
TACTCGGAAGTCTCCTTCATGGAGCTGGACAAATTCCTGGAGGATGTCAGGAATGGAATC 
TACCCACTGATGAACTTTGCAGCCACTCGACCCCTGGGGCTGCCCCGTGTGCTGGCCCCA 
CCCCCGGAGGAGGTCCAAAAGGCCAAGACCCCGACGCCAGAGCCCTTTGACTCTGAGACC 
AGAAAGGTCATCCAGATGCAGTGCAACCTGGAGAGAAGCGAGGACAAGGCGCGCTGGCAT 
CTCACTCTGCTTCTGGTGCTGGAAGACCGGCTGCACCGGCAGCTGACCTACGACCTGCTC 
CCAACGGACAGCGCCCAGGACCTCGCCTCGGAGCTCGTGCACTATGGCTTCCTCCACGAG 
GACGACCGGATGAAGCTGGCCGCCTTCCTGGAGAGCACCTTCCTCAAGTACCGTGGGACC 
CAGGCCTGACCCGGAGCCCCAGCCCCAGGGGACCATGCCGGGGTGCTGCCCGGGCAGGCC 
ATGTTGGGGAGACTCCAGCACCGTGGGGCTGCCCTCCTCCATGCGCCTGGGAGCACAAAG 
GCCCCGGTAGTGAAGGAACCCCCCGTCTCCTGAGAGTGGGGCTGACCCTGCCTTGGGCGC 
CGAGGGGTTGGGGGGTGGGTGTGGGGGAGCCGTTAGGCCTCCCAGGTCCTTAGGATCAGG 
GTTGCCCCCAGAACCCCTTCCCATATCCTCCATTCTCCGCCCTGAGTTCCTACCCAGGCT 
GCCTGGCTGGGGCCACTGCCTCCTCAGCATGCAGGAGGCTGCCCTGTAGGGAACCCCAGC 
TCTGGGGCTTGGGGGTGAGGGTCAGCCCTGGACAGACCTCTGCCCAGGGAACTGCTCCAT 
GGGGTCTGGGAGAGCAGCCATCCCCTGCTGGCACCATAGACCCACACAAGGAGCCTGCAC 
AGCAAGCCAGCGGTGACACACCTGCAGGTGTCAGGCATGGCACTGGGCACAACAGGGACC 
TGGCAGGAGAAACAGACCACAGAGAGGTCTGGAGTTGAGGCTGTTGTCAGCAAAGCGCCT 
GGTCCCACACAGCTCTGCCCTAGAGCCAGCTGTTTGACCCTTTACCCACCCTGAGACCAG 
AACTTGCAGCCCCTCTGCAGATCTCCTCTGGCCACTGCAGCCCCTCCAATGGGCTTTTTG 
TCTCATGCATTCCCTGGCCTGGAGGCGTCAGGGACCCCACATCCTCCCTGCTCCTCAGAC 
TCACAGCCCCTCCATGTTACCTCCCGCACCTCCTCCCTGGGGGAGCTGCTCCCTGGGCCT 
CTGAGGATGTCAGCTCCTGGCTCCCTGCCTCTCTCGCACTGCAGTCCTGGCTCAGTCTTA 
GAGATTTCTATGCCCTCATGGATTCTACCGCTGGCTTCCTGGCCTCTTGATTCTTGGCTT 
GCCTCTCCTCCAATTCCAAACTTAGTGAAATGGCCTTAAGCATTTTAAACTGTATGTATA 
CATTAGCGCATTCATGCCTTTCTAAACGCATTTCAAATGTCAACCAGGAAGGCAGACCAC 
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TGTATTAGTTTTATACTGCCGCTGTAAAATTTACCACAAACTTAGTGACTTAACACAAAT 
TTATTGCAATTCTGTAGGCTGGAAGTCTGACTATGGGTCTCACTGGACTAGAATCAAGGC 
TGGCAGGCTGCCTTCCTTCCTGGAGGTTCTAGGGGAGACTCTGTCTCCTGCTCCTTCAGG 
CTGCTGGCAGAATCCACATCCTTTCGGTGGCAGGGCCAAGGTCCCCACTTTCTTGCTGAC 
TGTAAACTAAGGCCACTTCCAGCTTGTAGAGGCTGCCTACATTCCTTGGCTCTTGGCCCC 
CTCCTCCATCTTCAGAGCTAGCAGGTTCAGTCTGTGTCACGAACCATTTCTCTGGTTCCC 
TGCAGACAGGAAAGGTTGTCCCTAAGGACTCATGAGATTAGGTTGGGCCCAGCCAGATAA 
TACATGATAATCTCCCTCCTCAAGGTTTTTAATATTAAACACATCTGCAGGACACATTTT 
GCCATGTAAACTAACATTCACTGGTTCCAGGGATTAAGGAATGAACCTCTTTTGTTGGGG 
AAGGGTGGCATTCTGCTGACCACAGCACTCCAACCAAAAGCCAAAAACCAAAGCAAGACT 
TACTAACGCATATCAAATAAATTAAAGGTACAAAATCGTGAATCTCAGTTATCTTAAATA 
TTCCAATACTATTTACAAAATTATTCAAATTCTCACGCCTTCCAACTCAAAATTAGCAAT 
CTAAAGTAATTTCCATATCCTAGATGGAAACCCTCATGCTAAACTGTCTGATTATGCATG 
GTTCTAAATGGTTTCAGTGGCAAATACATAACATTGTACTACTGATTAAACTGAACTTAA 

AAGC 

SEQ ID NO: 62_AA103218_M SGK034_M 

CCACGCGTCCGCACCAGAGTATGGCGAAGTCAATGATGGGACTGGCTTTGTGGACATCTT 

CTCCTTCGGGATGTGTGCACTGGAGATGGCTGTACTCGAGATCCAAGCCAACGGGGATAC 

CAGAGTCACAGAAGAGGCCATCGCTCGAGCCAGGCACTCACTGAGTGACCCCAACATGCG 

GGAATTCATCCTCTCCTGCCTGGCCCGGGACCCTGCCCGCCGACCCTCAGCCCACAACCT 

CCTCTTCCACCGAGTGCTCTTTGAGGTGCACTCGCTGAAGCTGCTGGCAGCTCACTGCTT 

CATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTAGAGGAAAAGACCAAGGCCATGGA 

CCTCCATGCAGTTTTGGCTGAGATGCCGCAGCCCCATGGACCCCCAATGCAGTGGCGGTA 

CTCAGAGGTCTCCTTCTTGGAGCTGGACAAATTCCTAGAGGATGTCAGGAACGGGATCTA 

TCCACTGATGAACTTTGCGGCTGCTCGGCCCTTGGGGCTTCCCCGTGTGTTGGCCCCACC 

CCCAGAGGAAGCCCAAAAGGCCAAAACTCCAACGCCAGAACCCTTTGACTCGGAGACCAG 

GAAGGTGGTCCAGATGCAGTGCAACCTGGAAAGAAGCGAGGACAAGGCTCGGTGGCACCT 

TACTCTGCTCTTGGTGCTTGAGGACCGGCTACATCGGCAGCTGACCTATGATCTGCTCCC 

AACGGACAGTGCCCAGGACCTCGCTGCTGAACTAGTGCATTATGGCTTCCTGCACGAGGA 

TGACAGGACAAAGCTAGCAGCCTTTCTGGAGACCACTTTTCTCAAGTACCGAGGGACGCA 

AGCGTGACCTTCCCAGTCCTGACGGCCCAGCAGAGATACAGGGGCTCAGGGTTGTCCACT 

TGGCAAAGAGCCCCCACACTGCTCAAAGCTGCCTTCTGCCTGTGTTCCCTGGAACTGAAC 

ACAGGCCCTGCTAGTGAAGACACCCCCACCCCCCAGCTTTCTGCAGCAGTGTGGGACCCT 

GGGGTGGTGATGGAGCCCTGAGCCTGGACGAGAGTGGATACAGGTCAGTTAGGGGAACCG 

CTCCATCTGGTACTAGACAACAGCCATGCCTTCAGGTGGCATAGAAACCTAGGGAAGGAG 

CCTGAACTCAGGTGTCACAGTGCTGGGCATCAGGCAGACCAGACCTGACCTGATTGGAGA 

ACTGTAGACTAGATAGCTTGGAGTTGAACCCATGGCCAGGGAATTCCTTGGTCCTGCTCA 

GACCAGTCCTGATCCCTTGCAGACCTGCCTTGAGCCCTCTTTCTGATCTTCCACACTCTT 

GAGACCAGGACCTGTGTCCTCCCCAAAGCCCTTGGGAAGGATCTTTCTATTCATCATCCC 

TCTGGCCTAGGGGCTCAGGGGTCAGGCATCCTCCACATTCCCTCCCTGGGGAAGTTGTGT 

GTTTGAGTTGAGGATGTGGGTTCCTGGCTCCCTCTTTCTCCCCAGCCCAACTTGTCTCTT 

TCTTACTGGTTTCAAAGTCCTGATGAACGCTTCCCCTCAGAGCCACCCTGGTTTCCTTGG 

TTCTTGAACTGCCTCTCTCCCAACTTCAAACCAGGTCTTAAACGTTTTTTAAATGCATAT 

ATAAATGTAATGCAGTCACGGTCCTTTTTAAACACTTTGTGTATGAAACCAGGAAAGCTC 

ACTATTGTATTAGGAATAGTTCCACATTGCTGCTGTTAACAGATATCATAAACCCAGTGG 

TTTGAGACGACACACACACACACACACACACACACACACAGAGAGAGAGAGAGTTCTGTA 

CATCAAGTGTGATCCAGGCTCTCACTAGATTAATACCCAGGCTAAGTTCCTTTCTGGAAG 

CTGGGACTTACCTCCTGCTCCTTCAAGCTATTGGCAGAACTCACTTCCCTGCAATGGTAA 

GGCAGAAATCCCTATTTTCTCAACAGCTGCCAACTAAGAACCCCTCTCAGCTTCTAGAGG 



WO 00/73469 



PCT/US00/14842 



FIGURE 2 XX 

CCACCAACTTTTCTTAGTTCTTCTTTCTCCCCCCTCAAGACCAGCAGCGTCAAGTTGAAT 
CTTTGTCCTGGGCTAGCTGACTGGCTTGCCACTGCTGGGAAGAGTTGGGGCCTTTTGTGA 
GTAGGTTGGACCCACCAGGATAACCGAGGATGATCCCCTTCTCAGGGTCTATAGATGAAC 
CACACCTGCGCAGTTCCTTCTGCTGTCATCCTGGGCTTTGGTGCTTGGAGAACAGCCGTG 
GGCGGTGGGTGTTGTTACTGTGGTACCTACCATGCCATCTTAACCGAAACCAAGACCTAA 
AATAAAACAGATTTGTCATGGGACATCTAATAAATTAAATGAACTCTG 

SEQ ID NO: 63_NEK7_H. N34132_H 

CACGAATCCGAGCCCGCTCGCCTCTCTCCAGCGAACCGACCATGTCTGGCGGCGCCGCAG 
AGAAGCAGAGCAGCACTCCCGGTTCCCTGTTCCTCTCGCCGCCGGCTCCTGCCCCCAAGA 
ACGGCTCCAGCTCCGATTCCTCCGTGGGGGAGAAACTGGGAGCCGCGGCCGCCGACGCTG 
TGACCGGCAGGACCGAGGAGTACAGGCGCCGCCGCCACACTATGGACAAGGACAGCCGTG 
GGGCGGCCGCGACCACTACCACCACTGAGCACCGCTTCTTCCGCCGGAGCGTCATCTGCG 
ACTCCAATGCCACTGCACTGGAGCTTCCCGGCCTTCCTCTTTCCCTGCCCCAGCCCAGCA 
TCCCCGCGGCTGTCCCGCAGAGTGCTCCACCGGAGCCCCACCGGGAAGAGACCGTGACCG 
CCACCGCCACTTCCCAGGTAGCCCAGCAGCCTCCAGCCGCTGCCGCCCCTGGGGAACAGG 
CCGTCGCGGGCCCTGCCCCCTCGACTGTCCCCAGCAGTACCAGCAAAGACCGCCCAGTGT 
CCCAGCCTAGCCTTGTGGGGAGCAAAGAGGAGCCGCCGCCGGCGAGAAGTGGCAGCGGCG 
GCGGCAGCGCCAAGGAGCCACAGGAGGAACGGAGCCAGCAGCAGGATGATATCGAAGAGC 
TGGAGACCAAGGCCGTGGGAATGTCTAACGATGGCCGCTTTCTCAAGTTTGACATCGAAA 
TCGGCAGAGGCTCCTTTAAGACGGTCTACAAAGGTCTGGACACTGAAACCACCGTGGAAG 
TCGCCTGGTGTGAACTGCAGGATCGAAAATTAACAAAGTCTGAGAGGCAGAGATTTAAAG 
AAGAAGCTGAAATGTTAAAAGGTCTTCAGCATCCCAATATTGTTAGATTTTATGATTCCT 

GGGAAT CCACAGTAAAAGGAAAGAAGTG CATTGTTTTGGTGAC TG AACTTATGACGTC TG 

GAACACTTAAAACGTATCTGAAAAGGTTTAAAGTGATGAAGATCAAAGTTCTAAGAAGCT 

GGTGCCGTCAGATCCTTAAAGGTCTTCAGTTTCTTCATACTCGAACTCCACTTATCATTC 

ACCGCGATCTTAAATGTGACAACATCTTTATCACCGGCCCTACTGGCTCAGTCAAGATTG 

GAGACCTCGGTCTGGCAACCCTGAAGCGGGCTTCTTTTGCCAAGAGTGTGATAGGTACCC 

CAGAGTTCATGGCCCCTGAGATGTATGAGGAGAAATATGATGAATCCGTTGACGTTTATG 

CTTTTGGGATGTGCATGCTTGAGATGGCTACATCTGAATATCCTTACTCGGAGTGCCAAA 

ATGCTGCGCAGATCTACCGTCGCGTGACCAGTGGGGTGAAGCCAGCCAGTTTTGACAAAG 

TAGCAATTCCTGAAGTGAAGGAAATTATTGAAGGATGCATACGACAAAACAAAGATGAAA 

GATATTCCATCAAAGACCTTTTGAACCATGCCTTCTTCCAAGAGGAAACAGGAGTACGGG 

TAGAATTAGCAGAAGAAGATGATGGAGAAAAAATAGCCATAAAATTATGGCTACGTATTG 

AAGATATTAAGAAATTAAAGGGAAAATACAAAGATAATGAAGCTATTGAGTTTTGTTTTG 

ATTTAGAGAGAGATGTCCCAGAAGATGTTGCACAAGAAATGGTAGAGTCTGGGTATGTCT 

GTGAAGGTGATCACAAGACCATGGCTAAAGCTATCAAAGACAGAGTATCATTAATTAAGA 

GGAAACGAGAGCAGCGGCAGTTGGTACGGGAGGAGCAAGAAAACAAAAAGCAGGAAGAGA 

GCAGTCTCAAACAGCAGGTAGAACAATCCAGTGCTTCCCAGACAGGAATCAAGCAGCTCC 

CTTCTGCTAGCACCGGCATACCTACTGCTTCTACCACTTCAGCTTCAGTTTCTACACAAG 

TAGAACCTGAAGAACCTGAGGCAGATCAACATCAACAACTACAGTACCAGCAACCCAGTA 

TATCTGTGTTATCTGATGGGACGGTTGACAGTGGTCAGGGATCCTCTGTCTTCACAGAAT 

CTCGAGTGAGCAGCCAACAGACAGTTTCATATGGGTTCCCAANNCATGAACAGGCACATT 

CTACAGGCACAGTCCCAGGGCATATACCTTCTACTGTCCAAGCACAGTCTCAGCCCCATG 

GGGTATATCCACCCTCAAGTGTGCAGCAGGGAATACAGCAGACAGCCCCTCCTCAACAGA 

CAGTGCAGTATTCACTTTCACAGACATCAACCTCCAGTGAGGCCACTACTGCACAGCCAG 

TGAGTCAGCCTCAAGCTCCACAAGTCTTGCCTCAAGTATCAGCTGGAAAACAGAGTACTC 

AGGGAGTCTCTCAGGTTGCTCCTGCAGAGCCAGTTGCAGTAGCACAGCCCCAAGCTACCC 

AGCCGACCACTTTGGCTTCCTCTGTAGACAGTGCACATTCAGATGTTGCTTCAGGTATGA 

GTGATGGCAATGAGAACGTCCCATCTTCCAGTGGAAGGCATGAAGGAAGAACTACAAAAC 



WO 00/73469 



PCTAJS00/14842 



FIGURE 2YY 

GGCATTACCGAAAATCTGTAAGGAGTCGCTCTCGACATGAAAAAACTTCACGCCCAAAAT 

TAAGAATTTTGAATGTTTCAAATAAAGGAGACCGAGTAGTAGAATGTCAATTAGAGACTC 

ATAATAGGAAAATGGTTACATTCAAATTTGACCTAGATGGTGACAACCCCGAGGAGATAG 

CAACAATTATGGTGAACAATGACTTTATTCTAGCAATAGAGAGAGAGTCGTTTGTGGATC 

AAGTGCGAGAAATTATTGAAAAAGCTGATGAAATGCTCAGTGAGGATGTCAGTGTGGAAC 

CAGAGGGTGATCAGGGATTGGAGAGTCTACAAGGAAAGGATGACTATGGCTTTTCAGGTT 

CTCAGAAATTGGAAGGAGAGTTCAAACAACCAATTCCTGCGTCTTCCATGCCACAGCAAA 

TAGGCATTCCTACCAGTTCTTTAACTCAAGTTGTTCATTCTGCGGGAAGGCGGTTTATAG 

TGAGTCCTGTGCCAGAAAGCCGATTACGAGAATCAAAAGTTTTCCCCAGTGAAATAACAG 

ATACAGTTGCTGCCTCTACAGCTCAGAGCCCTGGAATGAACTTGTCTCACTCTGCATCAT 

CCCTTAGTCTACAACAGGCCTTTTCTGAACTTAGACGTGCCCAAATGACAGAAGGACCCA 

ATACAGCACCTCCAAACTTTAGTCATACAGGACCAACATTTCCAGTAGTACCTCCTTTCT 

TAAGTAGCATTGCTGGAGTCCCAACCACAGCAGCAGCCACAGCACCAGTCCCTGCAACAA 

GCAGCCCTCCTAATGACATTTCCACATCAGTAATTCAGTCTGAGGTTACAGTGCCCACTG 

AAGAGGGGATTGCTGGAGTTGCCACCAGCACAGGTGTGGTAACTTCAGGTGGTCTCCCCA 

TACCACCTGTGTCTGAATCACCAGTACTTTCCAGCGTAGTTTCAAGTATCACAATACCTG 

CAGTTGTCTCAATATCTACTACATCCCCGTCACTTCAAGTCCCCACATCCACATCTGAGA 

TCGTTGTTTCTAGTACAGGACTGTATCCTTCAGTAACAGTTTCAGCAACTTCAGCCTCTG 

CAGGGGGCAGTACTGCTACCCCAGGTCCTAAGCCTCCAGCTGTAGTATCTCAGCAGGCAG 

CAGGCAGCACTACTGTGGGAGCCACATTAACATCAGTTTCTACCACCACTTCATTCCCAA 

GCACAGCTTCACAGCTGTCCATTCAGCTTAGCAGCAGTACTTCTACTCCTACTTTAGCTG 

AAACCGTGGTAGTTAGCGCACACTCACTAGATAAGACATCTCATAGCAGTACAACTGGAT 

TGGCTTTCTCCCTCTCTGCACCATCTTCCTCTTCCTCTCCTGGAGCAGGAGTGTCTAGTT 

ATATTTCTCAGCCTGGTGGGCTGCATCCTTTGGTCATTCCATCAGTGATAGCTTCTACTC 

CTATTCTTCCCCAAGCAGCAGGACCTACTTCTACACCTTTATTACCCCAAGTACCTAGTA 

TCCCACCCTTGGTACAGCCTGTTGCCAATGTGCCTGCTGTACAGCAGACACTAATTCATA 

GTCAGCCTCAACCAGCTTTGCTTCCCAACCAGCCCCATACTCATTGTCCTGAAGTAGATT 

CTCATACACAACCCAAAGCTCCTGGAATTGATGACATAAAGACTCTAGAAGAAAAGCTGC 

GGTCTCTGTTCAGTGAACACAGCTCATCTGGAGCTCAGCATGCCTCTGTCTCACTGGAGA 

CCTCACTAGTCATAGAGAGCACTGTCACACCAGGCATCCCAACTACTGCTGTTGCACCAA 

GCAAACTCCTGACTTCTACCACAAGTACTTGCTTACCACCAACCAATTTACCACTAGGAA 

CAGTTGCTTTGCCAGTTACACCAGTGGTCACACCTGGGCAAGTTTCTACCCCAGTCAGCA 

CTACTACATCAGGAGTGAAACCTGGAACTGCTCCCTCCAAGCCACCTCTAACTAAGGCTC 

CGGTGCTGCCAGTGGGTACTGAACTTCCAGCAGGTACTCTACCCAGCGAGCAGCTGCCAC 

CT?TTCCAGGACCTTCTCTAACCCAGTCCCAGCAACCTCTAGAGGATCTTGATGCTCAAT 

TGAGAAGAACACTTAGTCCAGAGATGATCACAGTGACTTCTGCGGTTGGTCCTGTGTCCA 

TGGCG^CTCCAACAGCAATCACAGAAGCAGGAACACAGCCTCAGAAGGGTGTTTCTCAAG 

TCAAAGAAGGCCCTGTCCTAGCAACTAGTTCAGGAGCTGGTGTTTTTAAGATGGGACGAT 

TTCAGGTTTCTGTTGCAGCAGACGGTGCCCAGAAAGAGGGTAAAAATAAGTCAGAAGATG 

CAAAGTCTGTTCATTTTGAATCCAGCACCTCAGAGTCCTCAGTGCTATCAAGTAGTAGTC 

CAGAGAGTACCTTGGTGAAACCAGAGCCGAATGGCATAACCATCCCTGGTATCTCTTCAG 

Sg?gc2agagagtgcccacaaaactactgcctcagagg 

Saccaaggttggacgttttcaggtgacaactacagcaaacaaagtgggtcgtttctctg 
Stcaaaaactgaggacaagatcactgacacaaagaaagaaggaccagtggcatctcctc 
cttttatggatttggaacaagctgttcttcctgctgtgataccaaagaaagagaagcctg 
SSScagaLcttcacatctaaatgggccgtcttctgaccgggaggcggcttttttaa 
ctagggatgtggatgatggttccggtagtccacactcgccccatgagctgagctcaaaga 
gSttcctagccagaatctaagtcaaagccttagtaattcatttaactcctcttacatga 
gSgcgacaatgagtcagatatcgaagatgaagacttaaagttagagctgcgacgactac 
gSataaacatgtcaaagagattcaggacctgcagagtcgccagaagcatgaaattgaat 
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CTTTGTATACCAAACTGGGCAAGGTGCCCCCTGCTGTTATTATTCCCCCAGCTGCTCCCC 

TTTCAGGGAGAAGACGACGACCCACTAAAAGCAAAGGCAGCAAATCTAGTCGAAGCAGTT 

CCTTGGGGAATAAAAGCCCCCAGCTTTCAGGTAACCTGTCTGGTCAGAGTGCAGCTTCAG 

TCTTGCACCCCCAGCAGACCCTCCACCCTCCTGGCAACATCCCAGAGTCCGGGCAGAATC 

AGCTGTTACAGCCCCTTAAGCCATCTCCCTCCAGTGACAACCTCTATTCAGCCTTCACCA 

GTGATGGTGCCATTTCAGTACCAAGCCTTTCTGCTCCAGGTCAAGGTAATAAAGCAACCA 

TCATCGTCCAAAAACAATAAAATGGAGATGTTGCCATACCTGGGACAAAAGCCTGTTAAG 

GCGGGTTGGGAGACTAGCTGACCAGAACACAGCCTGTGTGTTGTACACTGAAGAATCTGG 

GTGAAAAGGGAAGTGGAGTGATAATGAGAATCGGTGGGCTCACTGCTCCCATTAGGTGAA 

ATTACTTTTTTTCAAGGAATTACAGTGAAAAGTTACATCTGTGTGGCCTATATGACTTGC 

TCATTTGGGATTTGGAACTTAGGCTTTAATATTAGGCTGAGATTTCCTGGATGAAATTCT 

AAGGTGTTTTAGCAGTTTCTGAAGCTAATACATTTTCTTAGCCATTGTAGAATTTTGTTA 

CTTTTAAGTATGGGAGTGGCATACTAAAATGAATAACCTTACAATTCAGTTTTTTATCCA 

TAATCTACTTTCCAAATATAGCTCTGTTTATTAGTGATTGCTGAAAAAATTCCCACAGAG 

GAAAGAGCTTTTAGTCATATTAGAACAAGAATTGAAAAGACTTGGGCATCTGGGTGAGAA 

GAATGAAAAAAATATAGGTACTGGCTTATGTGCCTTTGCCACAGTTTCACAGAAATTAGA 

GATCAGTCTCTTCACAGGAAGAATGCACTTGATTGGTAAGGAGGGCAAACTAGCTAGCAT 

TATTCGAACTAAGAAAAGCTTCCGCATTTTGCAGATGGGTAGAATTAAGACCTAATATTT 

CATCTCTTACATATCTGACCTTCCCCCCAGAAGCTTGTTCTTCTGTGTGCCATCTTAGTG 

CATTTCACCACTCCAGCCTCAAGTTTCTAACATCTTGTAGTTGTGTTCTGTCTCTTCTCC 

TCTCTCTGTTCTACCCTGTTTTTCCCCTCTCACAGGCTGTGCGAAGTTTAACTGTGCATC 

TGAACAGGTGACATTCAAACCTGGTGGCAGGAGGACCCGATTTCTGAGTACGCCCTGCTT 

GGCTCTTTGTGTGTAACACCTTTACTCCTTCCTTGTCCTTGTGTTTCTGCTGCTTGGATC 

TGATGTTTCACGCAGTCCATTTTCATTTGTCTCTTTTTGTATATCATCTACTCAGTGGCT 

TGGCTGAATTACTGTTACCCTCAGAAGTTTGGGCCCCCACATTAATTATGATAAAAAATG 

TCAAAATAACAAGTTATCTACAAATTTCAATGTAACTTTCTGGTAGAAGTGCTTCTTCAT 

GGATCTGTGACAGAGAGTGGATATGGTATCTAGGCAATAGATTGCTGGGTCATTTAGAAT 

AATGAAGACTGAACTCCACAGTCGTAGTCAGTGCTGTCTGTCTGCCCTAGCATTAGAAAT 

GAGAGAAATCAGCCAGACACGGTGGCGTACACCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGCGGGAAGATTGCTTGAGGCCAGGAGCTCGAGACCAACCCTGGGCAACATGGTGATACC 

CCATCTCT 

SEQ ID NO: 64_BCON3_H 

GCGGAGCGCAGCTGTGAGGGAGTCGCTGTGATCCGGGGCCCCGGAACCCGAGCTGGAGCT 
GAAGCGCAGGCTGCGGGGCGCGGAGTCGGGAGGCCTGAGTGTTCCTTCCAGCATGTCGGA 
GGGGGAGTCCCAGACAGTACTTAGCAGTGGCTCAGACCCAAAGGTAGAATCCTCATCTTC 
AGCTCCTGGCCTGACATCAGTGTCACCTCCTGTGACCTCCACAACCTCAGCTGCTTCCCC 
AGAGGAAGAAGAAGAAAGTGAAGATGAGTCTGAGATTTTGGAAGAGTCGCCCTGTGGGCG 
CTGGCAGAAGAGGCGAGAAGAGGTGAATCAACGGAATGTACCAGGTATTGACAGTGCATA 
CCTGGCCATGGATACAGAGGAAGGTGTAGAGGTTGTGTGGAATGAGGTACAGTTCTCTGA 
ACGCAAGAACTACAAGCTGCAGGAGGAAAAGGTTCGTGCTGTGTTTGATAATCTGATTCA 
ATTGGAGCATCTTAACATTGTTAAGTTTCACAAATATTGGGCTGACATTAAAGAGAACAA 
GGCCAGGGTCATTTTTATCACAGAATACATGTCATCTGGGAGTCTGAAGCAATTTCTGAA 
GAAGACCAAAAAGAACCACAAGACGATGAATGAAAAGGCATGGAAGCGTTGGTGCACACA 
AATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCCCCCATCATCCATGGGAACCT 
GACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAGATTGGCTCTGTGGCTCC 
TGACACTATCAACAATCATGTGAAGACTTGTCGAGAAGAGCAGAAGAATCTACACTTCTT 
TGCACCAGAGTATGGAGAAGTCACTAATGTGACAACAGCAGTGGACATCTACTCCTTTGG 
CATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAATGGAGAGTCCTCATATGT 
GCCACAGGAAGCCATCAGCAGTGCCATCCAGCTTCTAGAAGACCCATTACAGAGGGAGTT 
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CCACCCAGCATTGTTTGAAGTTCCCTOGCTC 
ACACCAACACATGATCCCAGAGAACGCTCTAG^ 

TGCCGTACTGGCTGAAATCCCTGCA^ 
TCAGTCACCAGCTCTGGAATTAGATAAATTCCTTG^ 

TCTGACAGCCTTTGGGCTGCCTCGGCCCCAGCAGC 
TGTCGTGCCCCCCTCTGTCAAGACTCCGA^ 
GGTGGTGCTGATGCAGTGCAACATTGAGTCG^^ 
ACTTCTGCTGAAGTTGGAGGACAAACTGAACCGGCACCTGAG^ 

CCAGAGCCGGTTGACTTCTCTGCTAGA^GAGACCTO 
CAGTACCCTCAACTCAGCCGCTGTCACCGTCTCCTC^TTA 

TGATCTGCGCTGTGGCTGTCCCTGGACG^ 
AGTATTACCCTGTGAAGCCCCTTCCCTCCTTTATTATTCAG^ 

TGGTTCTGAGCATCATCCTTTCCCCTCCCCTCTCTTCC^ 

TGTTTTGCACAGACGTGGGCCTGGGCCTTC 

CGCTGATCTGCCGGCTCCCGCCCAGCCTGTGTGG^ 

GCT 

SEQ ID NO: 6 5_AA7 1182 ^ ^ a r a &rrACAAGACTATGAATGAAAAGGCTTGG 

AAGAACCTACACTTTTTTGCACCAGAGTA^ 
GACATCTACTCCTTTGGCATGTGTGCACTC 

GGCGAGTCCTCATATGTC^^ 

TCATTACAGAGGGAGTTTATTCA^GTGCCTGC^ 

GCCAGAGAACTTCTGTTCCACCCAGCACTGm 

GCTCACTGTATCGTGGGGCACCAACACA^ 

AAGAACATGGATACCAGTGCTGTACTAGCTGAAA 

GTTCAGACTTTGTACTCTCAGTCACCAGCCCTAGAAT^ 

AGGAATGGGATCTACCCTCTGACAGCCTTTGGGC^^ 

GAGGAGGTGACATCACCTGTTGTGCCCCCCTC^ 

GAAGTGGAGACACGAAAGGTGGTGCTGATGCAG^ 

GCTCCCTGGTTGAGTATCACCCTGCCCCTTCCCCT^ 

TACTTGTTTTGCACAGACGTGGGCCTGGGCCT^ 

TAGTAGCTGACCTGCTGCCTCCTGCCCTACTTOT 

TTTTTGCT 
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SEQ ID NO: 66_AA099102_H 

ATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTG 

GGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTC 

TCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCG 

GGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGAG 

GTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCT 

CTGCAAGAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGC 

TGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTG 

CCCCGGCGGCCGACAGTGGAGTCTCACCACGTCTCCATCACGGGTATGCAGGACTGTGTG 

CAGCTGAATCAGTATACCCTGAAGGATGAAATTGGAAAGGGCTCCTATGGTGTCGTCAAG 

TTGGCCTACAATGAAAATGACAATACCTACTATGCAATGAAGGTGCTGTCCAAAAAGAAG 

CTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCT 

GGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTC 

AAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCCCAATGAG 

GACCATCTGTACATGGTGTTCGAACTGGTCAACCAAGGGCCCGTGATGGAAGTGCCCACC 

CTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATC 

GAGTACTTACACTACCAGAAGATCATCCACCGTGACATCAAACCTTCCAACCTCCTGGTC 

GGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGT 

GACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCT 

GAGACCCGCAAGATCTTCTCTGGGAAGGCCAAGGATGTTTGGGCCATGGGTGTGACACTA 

TACTGCTTTGTCTTTGGCCAGTGCCCATTCATGGACGAGCGGATCATGTGTTTACACAGT 

AAGATCAAGAGTCAGGCCCTGGAATTTCCAGACCAGCCCGACATAGCTGAGGACTTGAAG 

GACCTGATCACCCGTATGCTGGACAAGAACCCCGAGTCGAGGATCGTGGTGCCGGAAATC 

AAGCTGCACCCCTGGGTCACGAGGCATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAAC 

TGCACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGC 

TTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTC 

GAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAACTTGCTCACCAAAAAA 

CCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCT 

CCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCC 

TGCGTGGAAAGTTGCTGGGCCCCCGCCCCCGGCTCCCCCGCACGCATGCATCCACTGCGG 

CCGGAGGAGGCCATGGAGCCCGAGTAG 
SEO ID NO: 67 5R69 17 2_H 

CCGGGATGTGAGCCTGGTGGTTGGCAGCTGGAGCCACGTCGGAGGGGGAAGTGTCGCAGC 
ATTCTCTGCAGGCATCACAGACCTGAGGCAGTGGCCTCCGGAGGGCACTGGACAGAAACA 
GCCATCCAAGTGGCTGAGTGGAGGGACCCTGCTCAAGTGCAGCTGCAGTGGCCGGGGTTT 
CCCTCAGGTAGGGATCGGGGCGCCTTGTCGCCGCCAGCCACGTGTGGCGTCCGGTACAGT 
CAGCAGAGTGCAGGGTGCGGGCACCAGGAAAGGGGGCGCAGGGGAACTCCCGCGGGCCTC 
rCGTTTGCAAACTTCTCGCCTGGGCAGGAGGCGGTCGTGGGAAAGAAGGTGGAAGAGCGA 
GCTTTTTGGAACTGTGCACGGGACAGATTGGACGCACACCCCTCGGGAGGCGCGAAGGCA 
TGGAAAATTTGAAGCATATTATCACCCTTGGCCAGGTCATCCACAAACGGTGTGAAGAGA 
TGAAATACTGCAAGAAACAGTGCCGGCGCCTGGGCCACCGCGTCCTCGGCCTGATCAAGC 
CTCTGGAGATGCTCCAGGACCAAGGAAAGAGGAGCGTGCCCTCTGAGAAGTTAACCACAG 
CCATGAACCGCTTCAAGGCTGCCCTGGAGGAGGCTAATGGGGAGATAGAAAAGTTCAGCA 
ATAGATCCAATATCTGCAGGTTTCTAACAGCAAGCCAGGACAAAATACTCTTCAAGGACG 
TGAACAGGAAGCTGAGTGATGTCTGGAAGGAGCTCTCGCTGTTACTTCAGGTTGAGCAAC 
GCATGCCTGTTTCACCCATAAGCCAAGGAGCGTCCTGGGCACAGGAAGATCAGCAGGATG 
CAGACGAAGACAGGCGAGCTTTCCAGATGCTAAGAAGAGATAATGAAAAAATAGAAGCTT 
CACTGAGACGATTAGAAATCAACATGAAAGAAATCAAGGAAACTTTGAGGCAGTGTAAGT 
TATCATGTGCCCTGCTGTTTCTGATGGCCCCCAAACTAGAAGTCATCAGTTTACTGGGAC 
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FIGURE 2CCC 

CCCAGCCTCCCGCTACCCCTGCATTTGTCCATTTTCTGTGCTGGATGGCTGGAAGCAGCC 

CACAGGTTTGGGGATCCATTCATGGCTAGCCCAGGCTTCTGTCCATGGAATAACATGTGG 

AGAGAGCTTCTTGACCAGTAAGATACCTTCTAGCAGCTGTCAAAGTACTTAAAAACCTCT 

ATGAATAGAATCAAAGCTTCAGTTCAGTTGCTGAATTTCCAAGAAGAAATTCAAATCAAA 

TTTAAAATGCCCACTCATTCATTCATTCAACAAAACTGTGAGTATCTGGTTTATGCCAGA 

GGCCATGCAAAGAGGTAACTAAGATGCAGAGAAGGACACTGCCTTCCAGGAGCTCACGGG 

GTGGAGGAGGAAAGAGGAAAGACAGACAGTGAACACACAACAGCAAGGTTACTGAGCTTG 

AACTATGTCCCTAACTACTAGATCTGAAATGACTACGCCAGATGCCAGATGCTCAAGTGC 

CAAGCTCTGGGTAACAGGAATAGACATCCTTCCAGGATGAGAGAGATGAGTCTGGATGAG 

GGTTAAGGCTGGAGGGACAGGCGGGATTTGAAGAGGAGGGAAAGGAAGTGGATGACACAT 

TCTGTTAACTGTCCAGCTGTGTCTCTACTGGTCACTCAGAGGCACGGGAGCCGCTCCCTT 

GGGCTGAGTCCATCAGAAGCCCCAGCCACCACCAGCTCTGGTTCATGTAGTAGAGCTTCC 

CACTCACACATCACAAATATGCCACCTCCCTTAGGACCCCTTCCTCTGCTCATTGACTCT 

TTTGTCTTCTTTCCTCTCGGGGGTGAGGTCAGATTTACCACCAAAATGCATGCAGGAGAT 

CCCGCAAGAGCAAATCAAGGAGATCAAGAAGGAGCAGCTTTCAGGATCCCCGTGGATTCT 

GCTAAGGGAAAATGAAGTCAGCACACTTTATAAAGGAGAATACCACAGAGCTCCAGTGGC 

CATAAAAGTATTCAAAAAACTCCAGGCTGGCAGCATTGCAATAGTGAGGCAGACTTTCAA 

TAAGGAGAT CAAAACCATGAAG AAATTCGAATCTCCCAAC AT C CTG CGTATATTTGGGAT 

TTGCATTGATGAAACAGTGACTCCGCCTCAATTCTCCATTGTCATGGAGTACTGTGAACT 

CGGGACCCTGAGGGAGCTGTTGGATAGGGAAAAAGACCTCACACTTGGCAAGCGCATGGT 

CCTAGTCCTGGGGGCAGCCCGAGGCCTATACCGGCTAGACCATTCAGAAGCACCTGAACT 

CCACGGAAAAATCAGAAGCTCAAACTTCCTGGTAACTCAAGGCTACCAAGTGAAGCTTGC 

AGGATTTGAGTTGAGGAAAACACAGACTTCCATGAGTTTGGGAACTACGAGAGAAAAGAC 

AGACAGAGTCAAATCTACAGCATATCTCTCACCTCAGGAACTGGAAGATGTATTTTATCA 

ATATGATGTAAAGTCTGAAATATACAGCTTTGGAATCGTCCTCTGGGAAATCGCCACTGG 

AGATATCCCGTTTCAAGGTGAAGAATGTGAAGACTGGCTCAGCCAGTGGCTGTAATTCTG 

AGAAGATCCGCAAGCTGGTGGCTGTGAAGCGGCAGCAGGAGCCACTGGGTGAAGACTGCC 

CTTCAGAGCTGCGGGAGATCATTGATGAGTGCCGGGCCCATGATCCCTCTGTGCGGCCCT 

CTGTGGATGAAATCTTAAAGAAACTCTCCACCTTTTCTAAGTAGTGTATCAAAATCTAAA 

CCAAGGAGTCTCTGGACAAGAAGCTGGGAGAGGCACGAACTGGACATCTCTCTCTCTCAT 

ATCCTTCGGCATTGGGTTATCTATGGGTGCAAGGAGTGGGCACGCTTCTCTGTTACAAAT 

AGAAAACGATTCCAGTCATACAGGACACATCCCACTCCAAATGATATTTCCAAAAACATA 

CCTCTGACAGTAACTTTGATAGATGGTTTGTCAAATGTATCTTTCTGGGTATCCACACCT 

CTTGGCAATGAAATTTGCAGCTCCTCCCTTCCATAAATGAAGTCTCTTTCCCCACCATTT 

GAATCTGGGCTGGCACTGTGACTTGATTTGATCAATAGAATGTGGAAGAAGTGACTGTAT 

GCCAGTTCCAAGCCTAGGTTTCAAGAGGCCTTATAAATGTCTGTTGGAACCTTACCCAGC 

CATGGACATGTTGAGTGAGCATGCTGGAGAATGAGAGACCACATGAAGCAGAAACATGCT 

TTCCTAGCTGAAGTCATACTAGCCCAACCAACATGGCAGCTAACACATGAATGAGGCCAA 

TCAAGACCAGAAGAACCACTCAAGCAGATCCCAGCCCAAATTGCCCATTCACACAATCAG 

GAGCTAAATAAATTACTGTTGTCTTTT 
SEO ID NO: 68 H85811_H 

CGCCCGGCCCCCTCCCCCGGCGCCGGCCACGGGAGGCGGTGATGCGGGCGCGGGCGGCCT 
CGGCTGCGCCGAGAGCGGAGACACAGGCTCAAGATGGCAGATTCCGACTGAGGCTGGGGG 
GGCCGAGCTCGCGCGCCGCTTTCCCGTCCCCGTTGCCATGAACCGCGGACACCCCGGCCC 
CGATGGCCCCCGTGTACGAAGGTATGGCCTCACATGTGCAAGTTTTCTCCCCTCACACCC 
TTCAATCAAGTGCCTTCTGTAGTGTGAAGAAACTGAAAATAGAGCCGAGTTCCAACTGGG 
ACATGACTGGGTACGGCTCCCACAGCAAAGTGTATAGCCAGAGCAAGAACATCCCCCTGT 
CGCAGCCAGCCACCACAACCGTCAGCACCTCCTTGCCGGTCCCAAACCCAAGCCTACCTT 
ACGAGCAGACCATCGTCTTCCCAGGAAGCACCGGGCACATCGTGGTCACCTCAGCAAGCA 
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FIGURE 2DDD 

GCACTTCTGTCACCGGGCAAGTCCTCGGCGGACCACACAACCTAATGCGTCGAAGCACTG 
TCAGCCTCCTTGATACCTACCAAAAATGTGGACTCAAGCGTAAGAGCGAGGAGATCGAGA 

ACACA^GCAGCGTGCAGATCATCGAGGAGCATCCACCCATGATTCA 
GGGCCACTGTCGCCACTGCCACCACGTCTACTGCCACCTCCAAAAACAGCGGCTCCAACA 

GCGAGGGCGACTATCAGCTGGTGCAGCATGAGGTACTGTGCTCCATGACCAACACCTACG 

agSc??agagScttgggccgag^ 

GCACCAATGAGATCGTAGCCATCAAGATCCTGAAGAACCACCCATCCTATGCCCGAC 

CTCAGATTGAAGTGAGCATCCTGGCCCGGTTGAGCACGGAGAGTGCCGATGACTATAACT 

TCGTCCGGGC^TACGAATGCTTCCAGCACAAGAACCACACGTGCTTGGTCTTCGAGATGT 

TGGAGCAGAACCTCTATGACTTTCTGAAGCAAAACAAGTTTAGCCCCTTGCCCCTCAAAT 

ACATTCGCC^AGTTCTCCAGCAGGTAGCCACAGCCCTGATGAAACTCAAAAGCCTAG 

TTATCCACGCTGACCTCAAACCAGAGAACATCATGCTGGTGGATCCATCTAGA^ 

IcAGAGTCAAGGTCATCGACTTTGGTTCAGCCAGCCACGTCTCCAAGGCTGTGTGCTCCA 

cSac??gcagtccagatattacagggcccctgagatcatccttggtttacc^ 

aggcaIttgacatgtggtccctgggctgtgttattgcagaat^ 
tatatccaggagattcggagtatgatcagattcggtatatttcacaaacacagggtttgc 

ctgctga^tatttattaagcgccgggacaaagacaactaggtttttcaaccgtgacacgg 

ac?cI?catatcctttgtggagactgaagacaccagatgaccatgaagcagagacaggga 

™S^ca^gLgcaagaaagtacattttcaactgtttagatgatatggccc^ 

I^gacgacagatttggaagggagcgacatgttggtagaaaaggctgaccggcgg^ 

tca^gacctgttgaagaagatgctgaccattgatgctgacaagag^ 
Jaaccctgaaccatccctttgtcaccatgacacacttactcgattttccccacagcacac 

cgg^cagagcaaaacccctttcatcacgcacgtggcccccagcacgtc^ 

tcacca^gacctttaacaaccagctgaccactgtccacaaccagccctca^ 

tcSSgcaStggcccagcggagcatgcccctgcagacaggaacagcccagatttgtgccc 

gSSgacccgttccagcaagctctcatcgtgtgtccccccggcttcca^^^ 

???S?cSc?IIgcacgctggctactcggtgcgaatggaaaatgcagttcccatcgtca 

ScSgcScLgagctcagcctcttcagatccaaccaggtctgcttgcccagcaggctt 

S^gSggacccagcagatcctgcttcccccagcatggcagcaactgactggaot 

ccacccacacctcagtgcagcatgccaccgtgattcccgagaccatggcaggcacccagc 

agctggcggactSLaa^^^ 

agcSgcaSattgaccggtcatgtgacgcttccagcagcacagccctt^^ 

tcgccScgtgatgcggcagcagccaaccagcaccacctcctcccggaagagtaa^^ 

IcSg^atctgtgagaaatgtctccacctgtgaggtgtcctcctctcaggccatcagct 

CCCCACAGCGA^^ 

gSgc?cggcctgcagcacctcggtcacctgtgggtggggcgacgtggcc 
?SgggScggcLcggcagacaattgtcattcccgacactcccagccccacggtca^ 
tca?cac^a?cagcagtgacacggacgaggaggaggaacagaaacacgccccc^ 
Sg?ctccaagcaaagaaaaaacgtcatcagctgtgtcacagtccacgactccccctact 

SgaSc^cagcaacaccag 

CCA^TGCCTTTGACACCAAGGGGAGCCTGGAGAATCACTGCACGGGGAACCCCCGAACCA 

tca^cgScLcccctgaaaacccaggccagcgaagtattggtggagtgtgatagcc^ 

ScSgtcScaccagtcaccactcgtcctcctacaagtccaagtcctcc 

cc5ccac?agcggtcactcttcagggagctcatctggagccatcacctaccggcag^ 

ggccgggcccccacttccagcagcagcagccactcaatctcagccaggctcagcagcaca 

?cacca?ggLcgcactgggagccaccgaaggcagcaggcctacatcactcccac^ 

cSaggctccgtactccttcccgcacaacagccccagccacggcactgtgcacccgcatc 

togctgSgccgctgccgctgcc 
cgccScggccctgggctccaccggca 
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FIGURE 2EEE 

CGCGCCACACCGTGCAGCACACTGCCTACCCAGCCAGCATCGTCCACCAGGTCCCCGTGA 
GCATGGGCCCCCGGGTCCTGCCCTCGCCCACCATCCACCCGAGTCAGTATCCAGCCCAAT 
TTGCCCACCAGACCTACATCAGCGCCTCGCCAGCCTCCACCGTCTACACTGGATACCCAC 
TGAGCCCCGCCAAGGTCAACCAGTACCCTTACATATAAACACTGGAGGGGAGGGAGGGAG 
GGAGGGAGGGAGAGAATGGCCCGAGGGAGGAGGGAGAGAAGGAGGGAGGCGCTCCTGGGA 
CCGTGGGCGCTGGCCTTTTATACTGAAGATGCCGCACACAAACAATGCAAACGGGGCAGG 
GGCGGGGGGGGGGGGGCAGAGGGCAGGGGGACGGGTCGGGACACCAGTGAAACTTGAACC 
GGGAAGTGGGAGGACGTAGAGCAGAGAAGAGAACATTTTTAAAAGGAAGGGATTAAAGAG 

GGTGGGAAATCTATGGTTTTTATTTTAAAAAAG 
SEO ID NO: 69 DYRK3_H 

CGGGAGCGAAAGTGCGCTGAGCTGCAGTGTCTGGTCGAGAGTACCCGTGGGAGCGTCGCG 

CCGCGGAGGCAGCCGTCCCGGCGTAGGTGGCGTGGCCGACCGGACCCCCAACTGGCGCCT 

CTCCCCGAGCGGGGTCCCGAGCTAGGAGATGGGAGGCACAGCTCGTGGGCCTGGGCGGAA 

GGATGCGGGGCCGCCTGGGGCCGGGCTCCCGCCCCAGCAGCGGAGTTGGGGGATGGTGTC 

TATGACACCTTCATGATGATAGATGAAACCAAATGTCCCCCCTGTTCAAATGTACTCTGC 

AATCCTTCTGAACCACCTCCACCCAGAAGACTAAATATGACCGCTGAGCAGTTTACAGGA 

GATCATACTCAGCACTTTTTGGATGGAGGTGAGATGAAGGTAGAACAGCTGTTTCAAGAA 

TTTGGCAACAGAAAATCCAATACTATTCAGTCAGATGGCATCAGTGACTCTGAAAAATGC 

TCTCCTACTGTTTCTCAGGGTAAAAGTTCAGATTGCTTGAATACAGTAAAATCCAACAGT 

TCATCCAAGGCACCCAAAGTGGTGCCTCTGACTCCAGAACAAGCCCTGAAGCAATATAAA 

CACCACCTCACTGCCTATGAGAAACTGGAAATAATTAATTATCCAGAAATTTACTTTGTA 

GGTCCAAATGCCAAGAAAAGACATGGAGTTATTGGTGGTCCCAATAATGGAGGGTATGAT 

GATGCAGATGGGGCCTATATTCATGTACCTCGAGACCATCTAGCTTATCGATATGAGGTG 

CTGAAAATTATTGGCAAGGGGAGTTTTGGGCAGGTGGCCAGGGTCTATGATCACAAACTT 

CGACAGT?CGTGGCCCTAAAAATGGTGCGCAATGAGAAGCGCTTTCATCGTCAAGCAGCT 

PRRGAGATCCGGATTTTGGAGCATCTTAAGAAACAGGATAAAACTGGTAGTATGAACGTT 

ATCCACATGCTGGAAAGTTTCACATTCCGGAACCATGTTTGCATGGCCTTTGAATTGCTG 

AGCATAGACCTTTATGAGCTGATTAAAAAAAATAAGTTTCAGGGTTTTAGCGTCCAGTTG 

GTACGCAAGTTTGCCCAGTCCATCTTGCAATCTTTGGATGCCCTCCACAAAAATAAGATT 

ATTCACTGCGATCTGAAGCCAGAAAACATTCTCCTGAAACACCACGGGCGCAGTTCAACC 

AAGGTCATTGACTTTGGGTCCAGCTGTTTCGAGTACCAGAAGCTCTACACATATATCCAG 

TCTCGGTTCTACAGAGCTCCAGAAATCATCTTAGGAAGCCGCTACAGCACACCAATTGAC 

ATATGGAGTTTTCGCTGCATCCTTGCAGAACTTTTAACAGGACAGCCTCTCTTCCCTGGA 

gagLtgaaggagaccagttggcctgcatgatggagcttctagggatgccaccaccaa^ 

CTTCTGGAGCAATCCAAACGTGCCAAGTACTTTATTAATTCCAAGGGCATACCCCGCTAC 
TGCTCTGTGACTACCCAGGCAGATGGGAGGGTTGTGCTTGTGGGGGGTCGCTC^ 

GGTTCTAAGCTGCCTCCAGTTGTTGGAATAGCCAATAAGCTTAAAGCTAACTTAATGTCA 
GAAACCAATGGTAGTATACCCCTATGCAGTGTATTGCCAAAACTGATTAGCTAGTGGACA 
GA^TATCCCCAGAGATGCATATGTGTATATTTTTATGATCTTACAAACCTGCAAATGGA 

Saaatgcaagcccattggtggatgtttttgttagagtagactttttttaaacaagaca^ 

AA^ATTTTTATATGATTATAAAAGAATTCTTCAAGGGCTAATTACCTAACCAGCTTGTAT 
TGGCCATCTGGAATATGCATTAAATGACTTTTTATAGGTCA 
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FIGURE 2FFF 
SEQ ID NO: 70_AA5 8 9241_M DYRK3JV1 

CCACGCGTCCGGAGTTGCTAGGAATGCCACCGCAGAAACTTCTGGAGCAATCCAAGCGTG 
CCAAGTACTTTATTAACTCCAAAGGCTTGCCTCGATACTGCTCCGTATCTACCCAGACGG 
ACGGGAGGGTGGTGCTTCTCGGGGGTCGCTCACGCAGGGGTAAAAAGCGAGGCCCGCCAG 
GCAGCAAAGACTGGGCAACCGCACTGAAGGGCTGTGGTGACTACTTGTTCATAGAGTTTC 
TGAAACGATGCCTCCAGTGGGACCCCTCTGCCCG'CCTCACCCCGGCTCAAGCATTAAGAC 
ATCCTTGGATTAGCAAGTCTACACCCAAACCTCTCACCATGGACAAGGTGCCAGGGAAGC 
GGGTAGTTAACCCTACAAATGCTTTCCAGGGACTGGGTTCCAAGCTGCCTCCAGTCGTTG 
GGATAGCCAGTAAGCTTAAAGCTT^ACCTAATGTCCGAAACCAGTGGTAGTATACCTCTGT 
GCAGTGTATTGCCAAAGCTGATTAGCTAGTGGACCACTCAGAGACTGATACATATCATAT 
GTATTTTTAATTACCTTGCAAACATGCAAATGGAAAACGGAATAATTGAAGCCCATTCAC 
TGATGGATATGTTTTTGTTAGACTTTTTTTTAACAAGGCAGAACATTTTTATATGACTAT 
AAAAGAACGCTTCAAGGGCTAATGTCAAACCAGCTTGTATTGGCCATCTGGAGTATACAT 
TAAATGACTTTTTCATAGGTC 

SEQ ID NO: 71_5R72_1 6__2_H 

GTCGAGGCGCAGCGCTGCCATGGCTGGGGGCCGTGGGGCCCCCGGGCGCGGCCGGGACGA 
GCCTCCGGAGAGCTACCCGCAACGACAGGACCACGAGCTACAGGCCCTGGAGGCCATCTA 
CGGCGCGGACTTCCAAGACCTGCGGCCGGACGCTTGCGGACCGGTCAAAGAGCCCCCTGA 
AATCAATTTAGTTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGTATATGTAAAAGTGGA 
TTTGAGGGTTAAATGCCCACCTACCTATCCAGATGTAGTTCCTGAAATAGAGTTAAAAAA 
TGCCAAAGGTCTATCAAATGAAAGTGTCAATTTGTTAAAATCTCGCCTAGAAGAACTGGC 
CAAGAAACACTGTGGGGAGGTGATGATCTTTGAACTGGCTTACCACGTGCAGTCATTTCT 
CAGCGAGCATAACAAGCCCCCTCCCAAGTCTTTTCATGAAGAAATGCTGGAAAGGCGGGC 
TCAGGAGGAGCAGCAGAGGCTGTTGGAGGCCAAGCGGAAAGAuAGAGCAGGAGCAACGTGA 
AATCCTGCATGAGATTCAGAGAAGGAAAGAAGAGATAAAAGAAGAGAAAAAAAGGAAAGA 
AATGGCTAAGCAGGAACGTTTGGAAATTGCTAGTTTGTCAAACCAAGATCATACCTCTAA 
GAAGGACCCAGGAGGACACAGAACGGCTGCCATTCTACATGGAGGCTCTCCTGACTTTGT 
AGGAAATGGTAAACATCGGGCAAACTCCTCAGGAAGGTCTAGGCGAGAACGTCAGTATTC 
TGTATGTAATAGTGAAGATTCTCCTGGCTCTTGTGAAATTCTGTATTTCAATATGGGGAG 
TCCTGATCAGCTCATGGTGCACAAAGGGAAATGTATTGGCAGTGATGAACAACTTGGAAA 
ATTAGTCTACAATGCTTTGGAAACAGCCACTGGTGGCTTTGTCTTGTTGTATGAGTGGGT 
CCTTCAGTGGCAGAAAAAAATGGGTCCATTCCTTACCAGTCAAGAAAAAGAGAAGATTGA 
TAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAATTCAACTCACTGGTAAAATTGAG 
CCATCCAAATGTAGTACGCTACCTTGCAATGAATCTCAAAGAGCAAGACGACTCCATCGT 
GGTGGACATTTTAGTGGAGCACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTC 
AGGCCCCATCCCTGTGCATCAGCTTCGCAGGTACACAGCTCAGCTCCTGTCAGGCCTTGA 
TTATCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCTTGGTGGA 
TGCAGAAGGCACCGTCAAGATTACGGACTATAGCATTTCTAAGCGCCTCGCAGACATTTG 
CAAGGAGGATGTGTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAA 
AACGGGGAAGAAAGGAGATGTTTGGCGTCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGG 
ACAGGAATGTGGAGAGTACCCTGTGACCATCCCTAGTGACTTACCAGCTGACTTTCAAGA 
TTTTCTAAAGAAATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTCCCCAGCAGTTGTT 
GAAACACAGCTTTATAAATCCCCAGCCAAAT^ATGCCTCTAGTGGAACAAAGTCCTGAAGA 
TTCTGGAGGACAAGATTATGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGC 
CTTCTTTAGTGAGACACAGAGACAGTTTTCCCGATACTTCATTGAGTTTGAAGAATTACA 
ACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATCAAGGTGCAGAACAAGTTGGACGGCTG 
CTGCTACGCAGTGAAGCGCATCCCCATCAACCCGGCCAGCCGGCAGTTCCGCAGGATCAA 
GGGCGAAGTGACACTGCTGTCACGGCTGCACCATGAGAACATTGTGCGCTACTACAACGC 
CTGGATCGAGCGGCACGAGCGGCCGGCGGGACCGGGGACGCCGCCCCCGGACTCCGGGCC 

U/li) 
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FIGURE 2GGG 

CCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGCGACACAGACGGCCTGGA 

CAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 

GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGA 

GGACGACGACGAGGACGAGCACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGA 

TTCTGAAAGTGATATTATCTTTGACAATGAAGATGAGAACAGTAAAAGTCAGAATCAGGA 

TGAAGATTGCAATGAAAAGAATGGCTGCCATGAAAGTGAGCCATCAGTGACGACTGAGGC 

TGTGCACTACCTATACATCCAGATGGAGTACTGTGAGAAGAGCACTTTACGAGACACCAT 

TGACCAGGGACTGTATCGAGACACCGTCAGACTCTGGAGGCTTTTTCGAGAGATTCTGGA 

TGGATTAGCTTATATCCATGAGAAAGGAATGATTCACCGGGATTTGAAGCCTGTCAACAT 

TTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGGTTTGGCGACAGACCATCT 

AGCCTTTTCTGCTGACAGCAAACAAGACGATCAGACAGGAGACTTGATTAAGTCAGACCC 

TTCAGGTCACTTAACTGGGATGGTTGGCACTGCTCTCTATGTAAGCCCAGAGGTCCAAGG 

AAGCACCAAATCTGCATACAACCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTT 

TGAGATGTCCTATCACCCCATGGTCACGGCTTCAGAAAGGATCTTTGTTCTCAACCAACT 

CAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGACGATGGAGAGCATGCAAAGCA 

GAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCACAGCCACAGA 

GCTGCTCAAGAGTGAGCTGCTGCCCCCACCCCAGATGGAGGAGTCAGAGCTGCATGAAGT 

GCTGCACCACACGCTGACCAACGTGGATGGGAAGGCCTACCGCACCATGATGGCCCAGAT 

CTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATGACAGCGACATACTGAAGGG 

CAACTTCTCAATCCGTACAGCCAAGATGCAGCAGCATGTGTGTGAAACCATCATCCGCAT 

CTTTAAAAGACATGGAGCTGTTCAGTTGTGTACTCCACTACTGCTTCCCCGAAACAGACA 

AATATATGAGCACAACGAAGCTGCCCTATTCATGGACCACAGCGGGATGCTGGTGATGCT 

TCCTTTTGACCTGCGGATCCCTTTTGCAAGATATGTGGCAAGAAATAATATATTGAATTT 

AAAACGATACTGCATAGAACGTGTGTTCAGGCCGCGCAAGTTAGATCGATTTCATCCCAA 

AGAACTTCTGGAGTGTGCATTTGATATTGTCACTTCTACCACCAACAGCTTTCTGCCCAC 

TGCTGAAATTATCTACACTATCTATGAAATCATCCAAGAGTTTCCAGCACTTCAGGAAAG 

AAATTACAGTATTTATTTGAACCATACCATGTTATTGAAAGCAATACTCTTACACTGTGG 

GATCCCAGAAGATAAACTCAGTCAAGTCTACATTATTCTGTATGATGCTGTGACAGAGAA 

GCTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTCTAATAGTCT 

GTGTCGACTCTACAAGTTTATTGAACAGAAGGGAGATTTGCAAGATCTTATGCCAACAAT 

AAATTCATTAATAAAACAGAAAACAGGTATTGCACAGTTGGTGAAGTATGGCTTAAAAGA 

CCTAGAGGAGGTTGTTGGACTGTTGAAGAAACTCGGCATCAAGTTACAGGTCTTGATCAA 

TTTGGGCTTGGTTTACAAGGTGCAGCAGCACAATGGAATCATCTTCCAGTTTGTGGCTTT 

CATCAAACGAAGGCAAAGGGCTGTACCTGAAATCCTCGCAGCTGGAGGCAGATATGACCT 

GCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCCCACTGCCATTGGGGT 

CAGCATAGCTATAGACAAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTTACAAT 

AAGCTCTTGTGACCTCCTGGTTGTAAGTGTTGGTCAGATGTCTATGTCCAGGGCCATCAA 

CCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAATCATGTACGACTGGTCACA 

GTCCCAAGAGGAATTACAAGAGTACTGCAGACATCATGAAATCACCTATGTGGCCCTTGT 

CTCGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTTTCGAGAAGGAAAGGCAGACAGA 

GAAGCGTGTGCTGGAGACTGAACTTGTGGACCATGTACTGCAGAAACTGAGGACTAAAGT 

CACTGATGAAAGGAATGGCAGAGAAGCTTCCGATAATCTTGCAGTGCAAAATCTGAAGGG 

GTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCAACAGTGGTTCCCATTGT 

GAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACTCAGGT 

ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGCAGTGAAATTGAAAT 

TCTGGCTGTGGATCTACCCAAAGAAACAATATTACAGTTTTTATCATTAGAGTGGGATGC 

TGATGAACAGGCATTTAACACAACTGTGAAGCAGCTGCTGTCACGCCTGCCAAAGCAAAG 

ATACCTCAAATTAGTCTGTGATGAAATTTATAACATCAAAGTAGAAAAAAAGGTGTCTGT 

GCTATTTCTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAACCCTAAAGAAC 

TGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTGGAATAATGGAATGTTGTACAT 
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FIGURE 2HHH 

TCATCATAATTTAAAATTAAATTCTAAGAAGAGGCTGGGTGCAGTGGCTCACACCTTTAA 
TCCCAGCACTTTGGGAAGCCAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACCAG 

CCT 

SEQ ID NO: 73_R43524_H, HRI_H 

ATGCTGGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGACGGGGCTGGGGCT 

GTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACCCCGAATATGACGAA 

TCTGATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAACAGCCAACCTTCCCT 

TTTGCAGTTGCAAACCAACTCTTGCTGGTTTCTTTGCTGGAGCACTTGAGCCACGTGCAT 

GAACCAAACCCACTTCGTTCAAGACAGGTGTTTAAGCTACTTTGCCAGACGTTTATCAAA 

ATGGGGCTGTTGTCTTCTTTCACTTGTAGTGACGAGTTTAGCTCATTGAGACTACATCAC 

AACAGAGCTATTACACACTTAATGAGGTCTGCTAAAGAGAGAGTTCGTCAGGATCCTTGT 

GAGGATATTTCTCGTATCCAGAAAATCAGATCAAGGGAAGTAGCCTTGGAAGCACAAACT 

TCACGTTACTTAAATGAATTTGAAGAACTTGTCATCTTAGGAAAAGGTGGATACGGAAGA 

GTATACAAGGTCAGGAATAAATTAGATGGTCAGTATTATGCAATAAAAAAAATCCTGATT 

AAGGGTGCAACTAAAACAGTTTGCATGAAGGTCCTACGGGAAGTGAAGGTGCTGGCAGGT 

CTTCAGCACCCCAATATTGTTGGCTATCACACCGCGTGGATAGAACATGTTCATGTGATT 

CAGCCACGAGCAGACAGAGCTGCCATTGAGTTGCCATCTCTGGAAGTGCTCTCCGACCAG 

GAAGAGGACAGAGAGCAATGTGGTGTTAAAAATGATGAAAGTAGCAGCTCATCCATTATC 

TTTGCTGAGCCCACCCCAGAAAAAGAAAAACGCTTTGGAGAATCTGACACTGAAAATCAG 

AATAACAAGTCGGTGAAGTACACCACCAATTTAGTCATAAGAGAATCTGGTGAACTTGAG 

TCGACCCTGGAGCTCCAGGAAAATGGCTTGGCTGGTTTGTCTGCCAGTTCAATTGTGGAA 

CAGCAGCTGCCACTCAGGCGTAATTCCCACCTAGAGGAGAGTTTCACATCCACCGAAGAA 

TCTTCCGAAGAAAATGTCAACTTTTTGGGTCAGACAGAGGCACAGTACCACCTGATGCTG 

CACATCCAGATGCAGCTGTGTGAGCTCTCGCTGTGGGATTGGATAGTCGAGAGAAACAAG 

CGGGGCCGGGAGTATGTGGACGAGTCTGCCTGTCCTTATGTTATGGCCAATGTTGCAACA 

AAAATTTTTCAAGAATTGGTAGAAGGTGTGTTTTACATACATAACATGGGAATTGTGCAC 

CGAGATCTGAAGCCAAGAAATATTTTTCTTCATGGCCCTGATCAGCAAGTAAAAATAGGA 

GACTTTGGTCTGGCCTGCACAGACATCCTACAGAAGAACACAGACTGGACCAACAGAAAC 

GGGAAGAGAACACCAACACATACGTCCAGAGTGGGTACTTGTCTGTACGCTTCACCCGAA 

CAGTTGGAAGGATCTGAGTATGATGCCAAGTCAGATATGTACAGCTTGGGTGTGGTCCTG 

CTAGAGCTCTTTCAGCCGTTTGGAACAGAAATGGAGCGAGCAGAAGTTCTAACAGGTTTA 

AGAACTGGTCAGTTGCCGGAATCCCTCCGTAAAAGGTGTCCAGTGCAAGCCAAGTATATC 

CAGCACTTAACGAGAAGGAACTCATCGCAGAGACCATCTGCCATTCAGCTGCTGCAGAGT 

GAACTTTTCCAAAATTCTGGAAATGTTAACCTCACCCTACAGATGAAGATAATAGAGCAA 

GAAAAAGAAATTGCAGAACTAAAGAAGCAGCTAAACCTCCTTTCTCAAGACAAAGGGGTG 

AGGGATGACGGAAAGGATGGGGGCGTGGGATGA 
SEQ ID NO: 74 17000057519457_H 

CACAAGAGCCCTTCCTGCAGGGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGT 
AACCACTTACAGGCCGGAAGTGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCA 
GGATTGCCGAAGAAGTCCCAGGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAG 
AGACAGCTGATCGGTTGGAGCTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTA 
CGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGC 
GGAGCAGCCGCTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGT 
TCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACC 
GGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGC 
TCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTT 
CCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGT 
CCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGC 
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AGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACA 

TGCTCCTGAAACCCCCCCTGGAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTT 

TCATTTCAGCACTTCCAGAGGATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCC 

TCAGTACCCATCCCAACACTGAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCT 

CCTCCAAAAAGGCCAGGCCAGTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAA 

AGAGGTCCATGGTTGGGTAGAAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTT 

TCAAAGTAAATTTGAAGAAATGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAG 

ATATTTTTAAGTGGTATGTGATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACT 

ATGTGTCTAAGTCATGTTCTAGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTT 

CTCCCAGATTGTGACATGTATATCTCAGATACATGGGTGTGGCATTGAACCACATAATGA 

GAACATTATTCTCTTTTTAGTCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTC 

GCTGAGCTTACTGGCCCTCTAACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTAT 

ATTTATTTTGAAACCAGTTTAATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATA 

CAGCATGGAAAATATCAGTGTATTGTTTTATGAAACTTTCACGTGTATATATAGACCAAG 

GATATGTGCTGAGTTTTGATGTCAAATATATTTCTCTTTCAGGGTCATGATCAAAAAATG 

AAAAGTCTGCTTAACTCCAATTTCTCTTTTAAAAAAGCAGACTTACAGCTTTCAGGCAAC 

TGAAATTCATGTTAACATGTTTTTATTTTTATTGCTTTGTATTTTTGTGGTTACCTTCTA 

AGACAAGTGATTGATCTAAAGTTCCTTTTAAGTTTATACCGCTAAACAAACTGAGTTGAT 

TTCTATCACAGGCAGTAAGTAGGTAGAGCAAAAATGGTGAAGTGACTTGTGAAGACTGAA 

GTTTGATGAAGTCTGGTTTAAGGCACAGGTAAACTGAGTGTGGATGCAAAAGTACCAGGA 

GCTAGCTTTTAACCTTGCCCAGCCTCAGTTTCTTTTCTTAGAAGAAGCTATGTTTGGGTG 

GGAAGGGAAGAGAGGGATAAGAAAATACCTTTCTTCCTTGTAAACTCCAATCAACAAACA 

TATTTTGAGTGCCTTTTGTGTTCCTTGGCACCCTGTTGGGTATTGGGTACTTGGCACCCT 

GTTGGGTATTGGGTACAATGGTGAGCCAGACAGACACAGCGCCTGTCCTTTTGTAAGAAT 

ATTTATTTTTATAAAAAAGTATAAAGTATACAGTGGGATGTTTTGATATACATTATGAAA 

TGATTGCTACAGCTGAGCTAATTAACACCCATCACCTCACATAGTTACTGTCTTGTTTCT 

TAATATGGACATTTGCAGCTATGAATTTCCCTCTGCACACTGTTGTCATCACACACTCTC 

AGTTTTGGTATTTTGTGTTTTTGTTTTCATTCATCTCAAAGTATTTTCTAATTTCCCTTG 

TGATTTCTTCTTTGACCCCTTGATTGTTTAGAAATCTGTTAATTTCCACACATTTGTAAA 

TGTTCCAATTTTTCTTTTGTTATTGCCAGCTTCATTCCATTGTGTTCAGAGATGATACAG 

TCAGTGCCTGTTCTTATGAAGCAAACATTCTATAATAGTAGGACCAGTACCCTGTCTGTT 

TCATTCACCACAGTCAGCATGCCCCAAGTGCCCAGCATGGGGCGGATGGCCAGGAATGAG 

TGAAAACTTCCCTTCCTGGGTAGTTGTGACTAGTAGAGAGGAAAAATAATATAATTGCCT 

GCTTACTGCATGCCAGGCATTGGGCTGGGAATTTTTATATTGGATCTAAAATAACTCTTA 

AGTTAGGCATTATCCCCATTTTATAGATGGAGAAACTGGCCCCAAAAGGTGGGAACTTGT 

CCAAGACGTCACAGGTAGCAAGAGGTACTTTTACCTGGCTCCAAATCTGTGTTCTTTCCA 

CTGACAAATGAGATATGGGATATGGTGCATCTTTACAGTACTATAATAAGTATTGGCGTA 

TAACATTATTTTCAAGGAACTCCAAGGGCCACAGGAGCTGACAGGTTTTTCAATTAATAT 

TCCCAACATGAATGAGATGCCTCATTCCTCAGTTTCCTCACGTGTACTATAAGGCTAGTA 

CCTGCTTTGTTGGGGTATGGTTGGCTCGTGTGCATTAAGTCAACAAATCCCTAGT 

SEQ ID NO: 75_AA013524_M 

CTGGTGCAGCAGGGCGCCGAGGCGCGCGTTTTCCGTGGCCGCTTCCAGGGCCGCGCGGCC 
GTGGTGAAGCACCGCTTCCCGAAGAGTTACCGGCACCCGGAGCTGGAGGCGCGGCTCGGC 
CGTCGGCGGACGGTGCAGGAGGCGCGCGCGCTGCTCCGCTGCCGCCGTGCGGGGATAGCT 
GCCCCAGTCGTCTTCTTTGTGGACTATGCGTCTAACTGCTTATATATGGAAGAAATCGAA 
GACTCGGTGACTGTTCGGGATTATATCCAATCCACTATGGAGACTGAAAAGGACCCCCAG 
TGCCTCTTGGACCTGGCCAGGAGGATGGGGCAGGTTCTGGCCGGAATGCACGACCAAGAC 
CTCATTCACGGGGACCTCACCACCTCCAACATGCTCCTGAGGCGGCCCCTGGCGCAGCTG 
CACATCGTGCTCATCGACTTTGGGCTGAGCTTTGTCTCAGGACTGCCGGAAGATAAAGGC 
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GTCGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGCACGCACCCCCACACCGAGACCGCG 
TTTGAAGCCTTTCTGAAGAGTTACGGGGCCTCGTCCAAGAAGTCCAGTCCAGTGCTGAAG 
AAGTTAGATGAGGTGCGCCTGAGAGGGCGAAAGCGGTCCATGGTCGGGTAGTGGAGCTGT 
GGTGAACTGGCTCACGGTGAAGGATGATGTAGACGAGGCTGGACCCCTCAGCAAAGCATG 
GGTTGTTAAGTGGTCTGTGATCGTGCTGGGCCACCACCATCCATGGCTCACTGTTCTCAG 
GGGCTTCATGTACATGAGGTTTATTCTGGGCAGAACTGGGTAGGTAGCCCAGGCTAGCCT 
TGAATTTATGGCAACATCCTACCTCAGCTTGCTTGGAAGAGGTTATAAGCCACCATACCT 

GACTTTGCACTGATTCTGTCAGAAAC 

SEO ID NO- 76 1700O139801197_H, IRAKM_H 

ATGGCGGGGAACTGTGGGGCCCGCGGCGCGCTGTCGGCGCACACGCTGCTGTTCGACCTG 

CCGCCCGCGCTGCTCGGAGAGCTCTGCGCTGTTCTGGACAGCTGCGACGGCGCGCTGGGC 

TGGCGCGGCCTGGCAGAGAGACTTTCAAGCAGCTGGCTGGATGTTCGTCATATTGAAAAG 

TATGTAGACCAAGGTAAAAGTGGAACAAGAGAATTACTTTGGTCCTGGGCACAGAAAAAC 

AAGACCATCGGTGACCTTTTACAGGTCCTCCAGGAGATGGGACATCGTCGAGCTATTCAT 

TTAATTACAAACTATGGAGCAGTGTTGAGTCCTTCAGAGAAGAGTTATCAGGAAGGTGGA 

TTTCCAAATATATTATTCAAGGAAACAGCCAATGTCACCGTGGATAATGTTCTTATTCCT 

GAACATAATGAAAAAGGAGTACTGCTTAAATCTTCCATCAGCTTTCAAAATATCATAGAA 

GGAACTAGAAATTTCCACAAAGACTTCCTAATTGGAGAAGGAGAGATTTTTGAGGTATAC 

AGAGTGGAGATTCAAAACCTAACATATGCTGTCAAATTATTTAAACAGGAGAAAAAAATG 

CAGTGTAAGAAGCATTGGAAGAGGTTTTTATCTGAGCTTGAAGTTTTACTACTGTTTCAT 

CACCCAAACATACTAGAGTTGGCTGCATATTTTACAGAGACTGAGAAGTTCTGTCTGATT 

TATCCATACATGAGAAATGGAACACTTTTTGACAGATTGCAGTGTGTAGGTGACACGGCC 

CCACTCCCTTGGCACATTCGAATCGGTATATTAATAGGAATATCCAAAGCCATTCACTAC 

CTGCACAACGTTCAACCATGCTCGGTCATCTGTGGCAGTATATCAAGTGCAAACATCCTT 

TTGGATGATCAGTTTCAACCCAAACTAACTGATTTTGCCATGGCACACTTCCGGTCCCAC 

CTAGAACATCAGAGTTGTACCATAAATATGACCAGCAGCAGCAGTAAACATCTGTGGTAC 

ATGCCAGAAGAGTACATCAGACAGGGGAAACTTTCCATTAAAACAGATGTCTACAGCTTT 

GGAATTGTAATAATGGAAGTTCTAACAGGATGTAGAGTAGTGTTAGATGATCCAAAACAT 

ATCCAGCTGCGGGATCTCCTTAGAGAATTGATGGAGAAGAGAGGCCTGGATTCATGTCTC 

TCATTTCTAGATAAGAAAGTGCCTCCCTGCCCTCGGAATTTCTCTGCCAAGCTCTTCTGT 

TTGGCAGGCCGGTGTGCTGCAACGCGGGCAAAGTTAAGACCATCAATGGATGAAGTTTTA 

AATACTCTTGAAAGTACTCAAGCCAGCTTGTATTTTGCTGAAGATCCTCCCACATCACTA 

AAGTCCTTCAGGTGTCCTTCTCCTCTATTCCTGGAGAATGTACCAAGTATTCCAGTGGAA 

GATGATGAAAGCCAGAATAACAATTTACTACCTTCTGATGAAGGCCTGAGGATAGACAGA 

ATGACTCAGAAAACTCCTTTTGAATGCAGCCAGTCTGAGGTTATGTTTCTGAGCTTGGAC 

AAAAAGCCAGAGAGCAAGAGAAATGAGGAAGCTTGCAACATGCCCAGTTCTTCTTGTGAA 

GAAAGTTGGTTCCCAAAGTATATAGTTCCATCCCAGGACTTAAGGCCCTATAAGGTAAAT 

ATAGATCCTTCTTCAGAAGCTCCAGGGCATTCTTGCAGGAGCAGGCCAGTGGAGAGCAGC 

TGTTCCTCCAAATTTTCCTGGGATGAATATGAACAGTACAAAAAAGAATAA 



SEO ID NO: 77 AA840598 M IRAKM_M 

ATGTGGAAGAGATTTTTATCAGAACTGGAAGTTCTACTCCTGTTCCGTCACCCCCACATA 
CTAGAGCTGGCTGCATATTTCACGGAGACTGAGAAACTTTGTCTGGTTTATCCCTATATG 
AGCAACGGGACGCTTTTCGACAGATTACAGTGCACAAATGGCACAACCCCGCTTTCCTGG 
CACCTTCG^TCAGCGTATTGATAGGAATAGCCAAAGCCATCCAATACTTGCACAACACT 
CAGCCGTGCGCCGTCATCTGTGGCAACGTTTCCAGTGCAAACATACTCTTGGATGACCAG 
CTCCAACCCAAACTAACGGATTTTGCTGCAGCGCACTTCCGACCCAATCTAGAGCAGCAG 
AGTTCTACCATAAATATGACCGGCGGTGGCAGGAAACATCTGTGGTACATGCCAGAAGAA 
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GACCTCCTCATGGAACTGATG^GAA^^^^^^P^^ ^.pQQ^QQQQCGG 

AGGAAGATACCACCCTGTCC^^ 
TGTGTGGCAACGAAGGCCAAGTTAAGACCCACGATGG 

AGCACCCAGCCTAGCTTGTATTTTGCAGAAGACCC^ 
TGTCCTTCTCCACTGT^ 

CAGAATAACCATTCAGTACCTCCCAAGGAAGTT1 GA CCGAAACAGAGGG 

ACCCCCTTTGAATGCAGCCAGTCTGAGGTC^ACCTTTCTA 

AACAGGGGAAGTGAAGCGGATTGCAACGTGCCCAGTTCT^ 

CCAGAGCTTGTGGCGCCATCCCA^GAC^ 
TGGGAAGTACCAGGCCATTCTTAT^^ 

CTCTTTTGCAGTGAGCATGAACAGTO 
AAAATAAAAGCAAACGTCACTGAAGGC^ACTG^AGCAA^TA 

AGCTCTGAGCTCCGTGAGTACAGCCAAGGGA^ 

ATTAGCAGCAAGGAAGTCTATTCC^^ 

TCAAGTGACCGCCTCTCAGTCAAACCTGAGAAGCTAAA AGGACAAGTCTC TCTCTT 

AAGATTCCGGGTTCTGACAACCAAAACCTAGC^ 

AAGTCTCTCACTCTCTCTCATCATCCGAG^ 

CCCACCTTCCAGAACCAGAACCACCTTCTCCCCAA^^ r ArTCTCC CTCTGCGCATGCC 

gcagccLtagtcaccagcagccaatcatg^ 
ctacgtcctttataa^cccagg™ 

cagaggcagccttttgtatac^ttccctgacc^u TCATTT gcacaagaaca 

Stgttgcgtagtgtgactttgtggc™ 

agcgaatacacaacaacaaagcccacc^ 

tgctagggatactgacagtctatttgcttcccatggi ggaaggaattgtgt 
ttatattaaagaattccagcact 

TTAATGAAACTGCCATTCACCATCCCTGAGCTGGT1L. TGGA CCCTGAGTCA 

gLggggtcttctacacaggccggaagcagg^ 
ggggagacccagatgacactgaccacagag^ 
cgaacacagtatacggtcaccatgcat^ 

ACCTACCGCCGCTACTCAGCGCC^ 

ctggcgtcctgcgggatgggcctgctgctcaci^ GGCACCAGGA cggcctg 

cgccagctgccgcatctcacgctggctc^ 
ggccacatccgactgcctgcctcaggccccc^ 

GACACCCAGCTGCTAATGACGCTGTATGT^ 
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AGCACTGCTGTTAGATACCCCTCAGGCAGTGTCGC^TCCC^GCCAGT^^GCTCATT 



CK3GAGTGGAACTGCAGAGACAAGACi-l^»u „ CGAGCTGC atCCAGAAGAAAAA 

Sgctattgagcctgagccgagagamctttgggactc^ 

CCG^CCTGGATCGCGGGGGTCTGGAGCCCGAGGTCGTGC^ 

aIg^gctgcctgctggccgctgtagotcagcctcca^ 

^A^ACaCCTGCTCCGTGCTCTGA^^^^^ncACAAAC 
GGGAGGTGA 

SEQ ID NO: 79_HGP_6644466 TG TGTTGG TGCTAGAGGCAGCTGC 

t™ccgacctttccttgcagggggtgag^tctggac 

GGAAGGGATCAGTAATTTCAAGACA^ 

ATGTTCAACTCCAACTATAAATATCCCGGCCTCTC^ GGTTTGTCTC ATTCTCCTTG 
TACTGGGGTAAATGTGTACCTAATGAAAMATCTCCAAG ^tctotatcAAAAGAG 

ggctgtaaaaaagattaatcctatat^ 

^TGGATCMGCTAACmTmGAA^CCTTCATCA ATGGAGGT GAAAA 

tccttttactgaagccaatgatggca^ctgtgtcttg ccttttcca gcagc 

GTCTCTAMTGACTTAATAGAAGAACGATATAAAGCL TCTGC ACCAAGAAAA 

catIa^ttaaaasttgctttgaa^ 
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TAATGAAGACCCTAAAGATCGTCCTTCTGCTGCACACATTGTTGAAGCTCTGGAAACAGA 

^Sactgat^^ 

TTTACATAGTTACTATCAGTAGTTATTAGACTCTAAAATTGGCATATTTGAGGACCATAG 
TTTCTTGTTAACATATGGATAACTATTTCTAATATGA7^ATATGCTTATATTGGCTATAAG 
CACTTGGAATTGTACTGGGTTTTCTGTAAAGTTTTAGAAACTAGCTACATAAGTACTTTG 
ATACTGCTCATGCTGACTTAAAACACTAGCAGTAAAACGCTGTAAACTGTAACATTAAAT 
TGAATGACCATTACTTTTATTAATGATCTTTCTTAAATATTCTATATTTTAATGGATCTA 
CTGACATTAGCACTTTGTACAGTACAAAATAAAGTCTACATTTGTTTAAAACACTGAACC 
TTTTGCTGATGTGTTTATCAAATGATAACTGGAAGCTGAGGAGAATATGCCTCAAAAAGA 
GTAGCTCCTTGGATACTTCAGACTCTGGTTACAGATTGTCTTGATCTCTTGGATCTCCTC 
AGATCTTTGGTTTTTGCTTTAATTTATTAAATGTATTTTCCATACTGAGTTTAAAATTTA 
TTAATTTGTACCTTAAGCATTTCCCAGCTGTGTAAAAACAATAAAACTCAAATAGGATGA 
TAAAGAATAAAGGACACTTTGGGTACCAGAAGGTGTCTCAGCATTATTTTATACTTC 

9FO ID NO: 80 AA44 9542 M • 
ATCTCCAAGAGGGTTGTCTCATTCTCCTTGGGCCGTGAAAAAGATAAGTCTTTTATGCGA 

TGATCATTATCGAACTGTGTATCAGAAGAGACTAACTGATGAAGCTAAGATTTTAAAAAA 

CCTTAATCACCCAAACATTATAGGATATCGTGCTTTTACTGAAGCCAGTGATGGTAGTCT 

GTGCCTTGCTATGGAGTATGGAGGTGAAAAGTCTCTGAATGACTTAATAGAAGAGCGGAA 

CAAAGACAGTGGAAGTCCTTTTCCAGCAGCTGTAATTCTCAGAGTTGCTTTGCACATGGC 

CAGAGGGCTAAAGTACCTGCACCAAGAAAAGAAGCTGCTTCATGGAGACATAAAGTCTTC 

AAATGTTGTAATTAAAGGTGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCTCTCT 

GCCATTGGATGAAAATATGACTGTGACTGATCCTGAGGCCTGTTATATTGGTACTGAGCC 

ATGGAAACCCAAGGAAGCGTTGGAAGAAAATGGCATCATTACTGACAAGGCAGATGTGTT 

TGCTTTTGGCCTTACTCTGTGGGAAATGATGACTTTATGTATTCCACACGTCAATCTTCC 

Igatgatgatgttgatgaagatgcaacctttgatgagagtgacttcgatgatgaagcata 

TTATGCAGCTCTGGGGACAAGGCCATCCATCAACATGGAAGAGCTGGATGACTCCTACCA 
GAAGGCCATTGAACTCTTCTGTGTGTGCACTAATGAGGATCCTAAAGATCGCCCGTCTGC 
TGCACACATCGTTGAAGCTTTGGAACTAGATGGCCAATGTTGTGGTCTAAGCTCAAAGCA 
TTAACTTGTATGGGAACTGTTAACTAGATATATGTAGTTAATATAACTTATGGTAGCTAG 
ATTCTAGAAGTAGCTTTAACACTAGTGACCCCTGTCTAAGATGACTTAAGAATCAAGGGA 
CCATTGCTTTGTTACAGATCTTTTTAGATATTCTTGCTTCTTTAGTGGGTTACTAAAAAT 
TTCACTACGTACATGTGGTACAGATATCTGTCTGCTCATAGTGTCAGTCCTTCAGCTGGC 
CTGTCAGCCCATGCGCCCTGGGACTTGAGAAGAGTTCATAAACGTAGCTCCTAGGGTGTC 

SgcSctctac^cttagcttctaatttattactttgtttctactgattgtgtcttaag^ 

CTTTTAAAATAAATGTAAGAATAAACAATAAAAGACAGTTTTAGTACCAGG 
c;PO ID NO- 81 5R57 10 2 M TESK2_M 

GCTGCTGGACAGTGACTTGTATTTACCGTGGACTGTGAGAGTGAAACTGGCCTATGGCAT 
AGCAGTGGGCCTCAGCTACCTTCACTTCAAAGGCATTTTCCATCGGGACCTCACATCAAA 

GGTGTGAAGGCTTTGCTTTC 
qEO ID NO: 82 AA232253 H 

ATGTCGTCTCTCG^TGCCTCCT^TGTGCAAATTAAATTTGATGACTTGCAGTTTTTTGAA 
AACTGCGGTGGAGGAAGTTTTGGGAGTGTTTATCGAGCCAAATGGATATCACAGGACAAG 
^rrTGGCTGTAAAGAAGCTCCTCAAAATAGAGAAAGAGGCAGAAATACTCAGTGTCCTC 

GTCACAGAATATGCTTCTCTGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAG 

gaga^^ 

TTACATATGGAGGCTCCTGTCAAGGTGATTCACAGAGACCTCAAGTCAAGAAACGTTGTT 
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ATAGCTGCTGATGGAGTATTGAAGATCTGTGACTTTGGTGCCTCTCGGTTCCATAACCAT 

ACAACACACATGTCCTTGGTTGGAACTTTCCCATGGATGGCTCCAGAAGTTATCCAGAGT 

CTCCCTGTGTCAGAAACTTGTGACACATATTCCTATGGTGTGGTTCTCTGGGAGATGCTA 

ACAAGGGAGGTCCCCTTTAAAGGTTTGGAAGGATTACAAGTAGCTTGGCTTGTAGTGGAA 

AAAAACGAGAGATTAACCATTCCAAGCAGTTGCCCCAGAAGTTTTGCTGAACTGTTACAT 

CAGTGTTGGGAAGCTGATGCCAAGAAACGGCCATCATTCAAGCAAATCATTTCAATCCTG 

GAGTCCATGTCAAATGACACGAGCCTTCCTGACAAGTGTAACTCATTCCTACACAACAAG 

GCGGAGTGGAGGTGCGAAATTGAGGCAACTCTTGAGAGGCTAAAGAAACTAGAGCGTGAT 

CTCAGCTTTAAGGAGCAGGAGCTTAAAGAACGAGAAAGACGTTTAAAGATGTGGGAGCAA 

AAGCTGACAGAGCAGTCCAACACCCCGCTGCTGCCTTCCTTTGAGATTGGTGCATGGACG 

GAAGACGATGTGTATTGGTGGGTTCAGCAGCTCGTCAGAAAAGGTGACTCTTCAGCAGAG 

ATGAGTGTATATGCAAGCTTGTTTAAAGAAAACAACATTACAGGGAAGCGGCTGCTGCTG 

CTGGAGGAAGAAGACCTGAAAGACATGGGCATTGTCTCCAAGGGGCATATCATTCACTTC 

AAGTCAGCCATTGAGAAATTAACCCATGATTACATAAATTTGTTTCACTTCCCACCACTA 

ATTAAGGACTCAGGAGGTGAACCTGAAGAAAATGAGGAAAAAATAGTGAACCTGGAACTG 

GTTTTTGGTTTTCACTTGAAACCAGGAACTGGCCCACAGGATTGTAAGTGGAAAATGTAT 

ATGGAGATGGATGGGGATGAAATTGCAATAACCTACATAAAAGATGTGACATTCAACACT 

AACCTACCTGATGCGGAGATTTTAAAGATGACAAAGCCACCATTTGTAATGGAGAAGTGG 

ATTGTAGGAATAGCAAAAAGTCAGACTGTGGAGTGCACTGTCACATATGAGAGTGATGTT 

AGAACTCCAAAAAGCACTAAACATGTCCATTTGATTCAGTGGAGTAGAACAAAACCTCAG 

GATGAAGTGAAAGCAGTCCAACTTGCCATTCAGACATTATTCACCAATTCAGATGGCAAC 

CCTGGAAGCAGGTCCGACTCAAGTGCTGATTGCCAGTGGTTAGATACTCTGAGGATGCGG 

CAGATTGCATCCAACACTTCTTTACAGCGTTCCCAGAGCAATCCTATTCTGGGGTCACCG 

TTCTTCTCACACTTTGATGGCCAGGATTCCTACGCTGCTGCTGTGAGACGGCCCCAGGTG 

CCCATTAAGTATCAACAGATTACACCTGTGAACCAGTCCAGAAGCTCGTCTCCTACTCAG 

TATGGACTGACCAAAAACTTCTCTTCCTTACATCTCAACTCTAGGGACAGTGGCTTTTCC 

AGTGGCAATACTGACACCTCTTCAGAGAGGGGTCGATACTCAGACAGAAGCAGGAACAAA 

TATGGACGTGGTAGTATATCACTCAATTCTTCTCCTAGAGGAAGATACAGTGGAAAGAGT 

CAGCATTCCACTCCATCAAGAGGAAGATACCCTGGAAAGTTCTACAGGGTTTCTCAGTCA 

GCACTCAATCCTCACCAGTCGCCTGACTTCAAGAGAAGCCCCAGGGACCTCCACCAACCC 

AACACCATACCAGGGATGCCTTTGCACCCTGAGACTGACTCAAGAGCCAGTGAAGAGGAC 

AGCAAAGTCAGCGAAGGGGGCTGGACAAAAGTGGAATACCGGAAAAAGCCCCACAGGCCA 

TCTCCCGCCAAAACCAATAAAGAGAGAGCCAGAGGGGACCACCGTGGATGGAGAAACTTT 

TGA 

SEO ID NO: 83 AI375137 H 

ATGGGAAATTATAAATCTAGACCAACCCAAACTTGTACTGATGAATGGAAGAAAAAAGTC 
AGTGAATCATATGTTATCACAATAGAAAGATTAGAAGATGACCTGCAGATCAAGGAAAAA 
GAACTGACAGAACTAAGGAATATATTTGGCTCTGATGAAGCCTTCAGTAAAGTCAATTTA 
AATTACCGCACTGAAAATGGGCTGTCTCTACTTCATTTATGTTGCATTTGTGGAGGCAAG 
AAATCACATATTCGAACTCTTATGTTGAAAGGGCTCCGCCCATCTCGACTGACAAGAAAT 
GGATTTACAGCCTTGCATTTAGCAGTTTACAAGGATAATGCAGAATTGATCACTTCTCTG 
CTTCACAGTGGAGCTGATATACAGCAGGTTGGATACGGTGGCCTCACTGCCCTCCATATT 
GCTACAATAGCTGGCCACCTAGAGGCTGCTGATGTGCTGTTGCAACATGGAGCTAATGTC 
AATATTCAAGATGCAGTTTTTTTCACTCCATTGCATATTGCAGCGTACTATGGACATGAA 
CAGGTAACTCGCCTTCTTTTGAAATTTGGTGCTGATGTAAATGTAAGTGGTGAAGTTGGA 
GATAGACCCCTCCACCTAGCATCTGCAAAAGGATTCTTGAATATTGCAAAACTCTTGATG 
GAAGAAGGCAGCAAAGCAGATGTGAATGCTCAAGATAATGAAGACCATGTCCCACTCCAT 
TTCTGTTCTCGATTTGGACACCATGATATAGTTAAGTATCTGCTGCAAAGTGATTTGGAA 
GTTCAACCTCATGTTGTTAATATCTATGGAGATACCCCCTTACACCTGGCATGCTACAAT 
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GGCAAATTTGAAGTTGCCAAGGAAATCATCCAAATATCAGGAACAGAAAGTCTGACTAAG 

GAAAACATCTTCAGTGAAACAGCTTTTCATAGTGCTTGTACCTATGGCAAGAGCATTGAC 

CTAGTCAAATTTCTTCTTGATCAGAATGTCATAAACATCAACCACCAAGGAAGGGATGGG 

CACACTGGATTACACTCTGCTTGCTACCACGGTCACATTCGCCTGGTTCAGTTCTTACTG 

GATAATGGAGCTGATATGAATCTAGTGGCTTGTGATCCCAGCAGGTCTAGTGGTGAAAAA 

GATGAGCAGACATGTTTGATGTGGGCTTATGAAAAAGGGCATGATGCCATTGTCACACTC 

CTGAAGCATTATAAGAGACCACAAGATGAATTGCCCTGTAATGAATATTCTCAGCCTGGA 

GGAGATGGCTCCTATGTGTCTGTTCCATCACCCTTGGGGAAGATTAAAAGCATGACAAAA 

GAGAAGGCAGATATTCTCCTCCTAAGAGCTGGATTGCCTTCACATTTCCATCTTCAGCTC 

TCAGAAATTGAGTTCCATGAGATTATTGGCTCAGGTTCTTTTGGGAAAGTATATAAAGGA 

CGATGCAGAAATAAAATAGTGGCTATAAAACGTTATCGAGCCAATACCTACTGCTCCAAG 

TCAGATGTGGATATGTTTTGCCGAGAGGTGTCCATTCTCTGCCAGCTCAATCATCCCTGC 

GTAATTCAGTTTGTGGGTGCTTGCTTGAATGATCCCAGCCAGTTTGCCATTGTCACTCAA 

TACATATCAGGGGGTTCTCTGTTCTCCCTCCTTCATGAGCAGAAGAGGATTCTTGATTTG 

CAGTCTAAATTAATTATTGCAGTAGATGTTGCCAAAGGCATGGAGTACCTTCACAACCTG 

ACACAGCCAATTATACATCGTGACTTGAACAGTCACAATATTCTTCTCTATGAGGATGGG 

CATGCTGTGGTGGCAGATTTTGGAGAATCAAGATTTCTACAGTCTCTGGATGAAGACAAC 

ATGACAAAACAACCTGGGAACCTCCGTTGGATGGCTCCTGAGGTGTTCACGCAGTGCACT 

CGGTACACCATCAAAGCAGATGTCTTCAGCTATGCTCTGTGTCTGTGGGAAATTCTCACT 

GGCGAAATTCCATTCGCTCATCTCAAGCCAGCGGCTGCGGCAGCAGACATGGCTTACCAC 

CACATCAGACCTCCCATTGGCTATTCCATTCCCAAGCCCATATCATCTCTGCTGATACGA 

GGGTGGAACGCATGTCCTGAAGGAAGACCCGAATTTTCTGAAGTTGTCATGAAGTTAGAA 

GAGTGTCTCTGCAACATTGAGCTGATGTCTCCTGCATCAAGTAACAGCAGTGGGTCTCTC 

TCACCTTCTTCTTCTTCTGATTGCCTGGTGAACCGGGGAGGACCTGGCCGGAGTCATGTG 

GCAGCATTAAGAAGTCGTTTCGAATTGGAATATGCTCTAAATGCAAGGTCCTATGCTGCT 

TTGTCCCAAAGTGCTGGACAATATTCCTCTCAAGGTCTGTCTTTGGAGGAGATGAAAAGA 

AGTCTTCAATACACACCCATTGACAAATATGGCTATGTATCCGATCCCATGAGCTCAATG 

CATTTTCATTCTTGCCGAAATAGTAGCAGCTTTGAGGACAGCAGCTGA 

SEQ ID NO: 84_H97685_H 

ATGATTTCTTGCCTGTNATAACCTATGCACTCACAAAGATGAACTCTCTGAGAGGGATGA 
GCAAGAGCTTCAGGAAATCCGAAAGTATTTCTCCTTTCCTGTATTCTTTTTCAAAGTGCC 
GAAACTGGGCTCGGAGATAATAGACTCCTCAACCAGGAGAATGGAGAGCGAAAGATCACC 
GCTTTATCGCCAGCTAATTGACCTGGGCTATCTGAGCAGCAGTCACTGGAACTGTGGGGC 
TCCTGGCCAGGATACTAAAGCTCAGAGCATGTTGGTGGAACAGAGTGAAAAGCTGAGACA 
CTTGAGCACATTTTCTCACCAGGTGTTACAGACTCGCCTGGTGGATGCAGCCAAGGCCCT 
GAACCTGGTGCACTGCCACTGCCTTGACATCTTTATTAACCAGGCATTTGACATGCAGCG 
GGACCTGCAGATCACTCCCAAACGTCTGGAATATACTCGAAAAAAGGAGAATGAGTTGTA 
TGAATCATTGATGAATATTGCCAACCGAAAGCAGGAGGAAATGAAGGATATGATTGTTGA 
GACACTTAATACCATGAAGGAGGAACTTCTGGATGATGCTACTAACATGGAGTTTAAAGA 
CGTCATTGTCCCTGAGAATGGAGAACCAGTAGGCACCAGAGAGATCAAATGCTGCATCCG 
ACAGATCCAGGAACTCATCATCTCCCGACTTAATCAGGCAGTGGCTAATAAGCTGATCAG 
CTCAGTGGATTACCTGAGGGAAAGCTTCGTCGGAACCCTGGAACGATGTCTGCAGAGCCT 
GGAGAAGTCTCAGGATGTCTCAGTTCACATCACCAGTAATTATCTCAAACAGATCTTAAA 
TGCTGCCTATCATGTTGAAGTCACGTTTCACTCAGGGTCGTCAGTTACAAGGATGCTATG 
GGAGCAAATCAAACAGATCATCCAGCGCATCACATGGGTGAGCCCACCTGCCATCACTCT 
GGAATGGAAGAGGAAGGTGGCCCAGGAAGCCATTGAGAGCCTCAGCGCCTCCAAATTGGC 
TAAGAGCATTTGCAGCCAATTCCGGACTCGGCTCAATAGTTCCCACGAGGCTTTTGCAGC 
CTCCTTGCGGCAGCTGGAAGCTGGCCACTCAGGCCGGTTAGAGAAAACGGAAGATCTATG 
GCTGAGGGTTCGGAAAGATCATGCTCCCCGCCTGGCCCGCCTTTCTCTGGAAAGCCGTTC 
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FIGURE 2PPP 

TTTACAGGATGTCTTGCTTCATCGTAAACCTAAACTGGGACAGGAACTGGGCCGGGGCCA 
GTATGGTGTGGTATACCTGTGTGACAACTGGGGAGGACACTTCCCTTGTGCCCTCAAATC 
AGTTGTCCCTCCAGATGAGAAGCACTGGAATGATCTGGCTTTGGAATTTCACTATATGAG 
GTCTCTGCCGAAGCATGAGCGATTGGTGGATCTCCATGGTTCAGTCATTGACTACAACTA 
TGGTGGTGGCTCCAGCATTGCTGTGCTCCTCATTATGGAGCGGCTACACCGGGATCTCTA 
CACAGGGCTGAAGGCTGGGCTGACCCTGGAGACACGTTTGCAGATAGCACTAGATGTGGT 
GGAGGGAATCCGCTTCCTGCACAGCCAGGGACTTGTCCATCGTGATATCAAACTGAAAAA 
TGTGCTGCTGGATAAGCAGAACCGTGCCAAGATCACTGACTTAGGATTCTGCAAGCCAGA 
GGCCATGATGTCAGGCAGCATTGTGGGGACACCAATCCATATGGCCCCTGAACTTTTCAC 
AGGGAAGTACGATAATTCCGTGGATGTCTACGCTTTTGGAATTCTTTTCTGGTATATCTG 
CTCAGGCTCTGTCAAGCTCCCTGAGGCATTTGAGAGGTGTGCTAGCAAAGACCATCTCTG 
GAACAATGTGCGGAGGGGGGCTCGCCCAGAACGTCTTCCTGTGTTTGATGAGGAGTGCTG 
GCAGTTGATGGAAGCCTGTTGGGATGGCGACCCCTTGAAGAGGCCTCTCTTGGGCATTGT 
CCAGCCCATGCTCCAGGGCATCATGAATCGGCTCTGCAAGTCCAATTCTGAGCAGCCAAA 
CAGAGGACTAGATGATTCTACTTGAAAGCAAAGACCTTTCTCTTTCACTCTCTAGTTATT 
TCCTTCCCCCTCACCATTTGGCCATGGGGAGAATTTGACATTTATTCACTATAGGACACA 
CTCCCAAGGGAACTGGTGCTTGCTGGGAAACTTGGAACCTTCCCAGGCAGGGATGACTCC 
TGGACAGTGAAGAGTTGAATGACTGAGCATATTCAGCAGCTCACTGAAGCGCCAAGCTAT 
CCCTTTAGCAAAAAAGTGTCTCAGATGTGTAAAAGCTGAGGAATGTGGTGTTCTGGCTTC 
ACAAATGAAAAGGAGGCAGATGTT 

SEQ ID NO: 85__W2 0810_M 

TTGATGTCAACCTGAAGGCTTCTAAAGCGAGTGATGTCTACAGCTTTGGGATCCTCGTGT 
GGGCAGTGCTGGCTGGCAGAGT^AGCTGAGTTGGTAGACAAGACTTCACTAATCCGGGAAA 
CAGTGTGTGACAGGCAGAGTCGTCCTCCACTGACAGAGCTGCCTCCAGGTAGCCCTGAGA 
CTCCCGGCTTGGAAAAACTGAAGGAGTTAATGATTCATTGCTGGGGTTCCCAGTCCGAAA 
ACAGGCCATCCTTCCAGGACTGCGAACCAAAAACCAATGAAGTTTACAATCTGGTAAAGG 
ACAAGGTAGATGCTGCTGTCTCCGAGGTAAAGCATTATCTGTCTCAGCACAGT^AGCAGCG 
GCAGAAACTTGTCTGCCAGAGAGCCAAGCCAAAGAGGCACAGAAATGGATTGCCCGAGGG 
AAACCATGGTTTCTAAAATGCTGGACCGCCTGCATTTGGAGGAACCCTCCGGACCAGTTC 
CTGGAAAATGTCCTGAGAGGCAAGCACAGGACACATCAGTTGGGCCTGCCACACCAGCAA 
GGACATCTTCTGACCCCGTGGCTGGCACTCCTCAGATTCCACATACTTTACCCTTCAGAG 
GCACAACACCTGGGCCAGTCTTTACTGAGACTCCCGGTCCTCACCCCCAAAGGAATCAGG 
GAGATGGAAGACACGGCACTCCTTGGTATCCCTGGACCCCACCGAATCCAATGACAGGGC 
CACCGGCTCTCGTCTTCAACAACTGTTCTGAAGTGCAGATTGGGAACTACAACTCCTTGG 
TAGCACCACCAAGAACTACTGCCTCAAGTTCGGCCAAGTATGACCAAGCACAGTTCGGCA 
GGGGTAGGGGCTGGCAGCCCTTCCACAAGTAGACTTCAGAGAATCACTGCAAGAGCCTGA 
AGTGTGCCATTCAGCGTGGCAATAAAAAGCACGTTTTAAGCAACCTGGACTGGCTAAGAC 
AGTCCTTGCCACTTCCTGAAGCTCACAACATTCTGTGAGGACAGTTGGACCTACACCCAA 
ACTGACTCTTGACCCATCTCCTTAAAGTCAATAAACATAGCATGTTAACTGTG 

SEQ ID NO: 86_AA74423 6_H 

ATGGGATCAGAGAACAGTGCTTTAAAGAGCTATACACTGAGAGAACCACCATTTACCTTA 
CCCTCTGGACTTGCTGTTTATCCCGCTGTACTGCAAGATGGCAAATTTGCTTCAGTTTTT 
GTGTATAAGAGAGAAAATGAAGACAAGGTTAATAAAGCTGCCAAGCATTTG7^AGACACTT 
CGTCACCCTTGCTTGCTAAGATTTTTATCTTGTACTGTGGAAGCGGATGGCATTCATCTT 
GTCACTGAGCGAGTACAGCCCCTGGAAGTGGCTTTGGAAACATTGTCTTCTGCAGAGGTC 
TGTGCTGGGATCTATGACATATTGCTGGCTCTTATCTTCCTTCATGACAGAGGACACCTA 
ACACACAATAATGTCTGTTTATCATCTGTGTTTGTGAGTGAAGATGGACACTGGAAGCTA 
GGAGGAATGGAAACTGTTTGTAAAGTTTCTCAGGCCACACCAGAGTTTCTGAGGAGTATT 
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FIGURE 2QQQ 

CAGTCAATAAGAGACCCAGCATCTATCCCTCCTG^GAGATC^TCCAG^TT^C^CT 
CTCGCAGAGTGTCATGGACATGCCCGGGATG^ 

TTGCTCACAATC^A^TGMCA^T^CAG^GMGTTCTCTC^^^^^^^^^ 
TTGCACTCMCTTTGCTGAATCCCACTCCAAAA1G1L , GAAAAGTTTAAC A 

GCGCAGGGAGAAACTCCTTGCTTGCTCTCACCAGCC 

TTT ^aaa?™^ G ttc™g?Sgag« 

GACAGTGAAAACTTCCCATCAAt3iiui^«« GCC TAGAGAACCTTGTGAT 
CCTGAGGAGCCTGAAAATCAAACTGTCAACATACAGATT^ 

GATGTCAAGTCOCAGTGCACTACCTTGGATGTGG 

CCCAGCAGCTTAGATACTAAAGTAAACCC^ 

ACCTCAGGGGAGCAGAAGCCTATTCCTGC^ 

TGGAAATCAAGCTTACCCCAAA^GATTAGCGTTGTA^ 

ATCGAGCCGCCAAAAGTGTCATCACAAGAAA^GCCCCTT^ 

TTAGGAGAGGAATTCACCATTCAAGTAAAAAAGAAGCCAGTA^^ 

==Es=iS=== 

CTGAACTGGGAAGATAATAACTGGTGA 

SEQ ID NO: 87_AI052250 H AGCC AGAGTCAGTGGCCAGGCACGAA 
AGCGGCCGCGGGGGCGGCGGAG^GATATGGAGT 

GGCAGAGCAGGAACAGCCAGGAGGCGTTTATIA^ AT TGCAGCCATGGAGT 

CCGCCCCTCCTGGAAGAAGGAAGAGGTAACTATAACTAC^^ 

CCATGCTTAATAAATTGAAGAGTACTGTTACAAAAGTCACAG^ 

TAATGGGAATTCCTGTCACTAGAG^ 
ATGGGCTAGCTTGGAAGATTTTTAATGG^ 
TTTTTGTCTTTGATAAAAAACTGATTGACAACT 
TTGATTCTCTAAAACGAGGAGTCCAACAGTTAACTCGGCTT 

====== 

GGAAACCTATATTTGAAGTCAACAAGCAAGAT^ 



PCTAJS00/14842 

WO 00/73469 
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^^^^ 



)2 III) 
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FIGURE 2SSS 

TGGAGGCCAAACGCGCCGA^GGAAGGT^CCAAGGGCCC^TGW^^^^^^^^ 

^^^^^^^^^^ 
ATCAGAGCCACAATAAATTCTATTTCAC 

SEQ ID NO: 8 9_AA599286 H CTGCTGGACG ACACGGTGCCGCTG 
ATGGCCTTCATGGAGAAGCCGCCAGCCGG^ 

ACAGCAGCCATGGAGGCGAGCCAGAGC^ 

CAAGGAGGAATTTCTGTGGAAAACA^^^ 

TTGCTTAACAACAGCTTACAGATTG^ 

ATTGGTAACATGGATCGTGAATTCATAGCTG^^ 

AACGTGATCACAACAAATCATATCTTGTCTAATTG^ 

CCAAACAACTATTCCGCAAACTATACTGAGATTGC 

CGATCAGAGCCAAAGTGGGAGGTGGTGG^ 

AAGAAATATTTGTTGATGAAGATTAAA^TCAGCC^GGAA TCAAACTT 
GCTGACCTTGGCCCAGACAAGTATTTGTCAGATAA^G^ATTTTC 

ctgccttcttgtttgcacccttacatc™^ 

TTACTGTATGAAATGACTTATGGACGACCGCCAGACr^ TCTTGTGA AGCCTGT 

^s=iiSSiib==s-G 

cgaagactgacaagagctcagtcccac^^ 
attttagctcgaaagaagtcaaaacgatctot^^ 

aagtacagcaactccaataattcag^ 

TCATCGCCAACTCCACCCTCTACATCAGGGATATCTGCATrA A GCTCTCG 

CCACCACCACCAGCAGCTCCCTTGC™ 
TCTCAGGCTGTGAATGGCATGAGCCGAGGGGCCTTGCT 

AAAGGAACTTTGAGGAAAGCCAAACCTG^ 
TCCTGTTTACACTTGGAGGGAAAAGTTCOT 

TGGCATGCAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 90_AA425725 H GCGGCAGCGGC GGCAGTAGCAGCAGC 

ATGAGCGCCAGCACGGGCGGTGGTG^ 

TCACAGGCCTCCTGCGGGCCCGAGTCCTCGGGCTCCG 

CCTCAGATGCTGCAGGGCCTTCTGGGCTCCGACGACGA^ 
TACTGCAAGGGCGGCTACCACCCTGTGAAG^ 

gtggtgcgcaaactgggctggggcca™^ 

CGCAAGCGCTTTGTGGCCCTCAAAGTGGTC 
G f G f C S^ 
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FIGURE 2TTT 



TGCATGGTGCTGGAGGTGCTGGGCCACCAGCTCCTCAAATGGATCATCAAGTCCAACTAC 
CAGGGCCTGCCCGTGCCCTGCGTGAAGAGCATCGTGAGGCAGGTGCTGCACGGCCTGGAC 
TACCTCCACACCAAGTGCAAGATCATCCACACGGACATCAAGCCCGAGAACATCTTGCTG 
TGTGTGGGGGACGCTTACATCAGGCGCCTGGCTGCCGAGGCCACGGAGTGGCAACAGGCA 
GGGGCGCCGCCCCCCTCCCGCTCCATAGTCAGCACTGCCCCCCAGGAGGTCTTGACCGGT 
AAGCTGTCCAAAAACAAGAGGAAGAAGATGAGGCGCAAACGGAAACAGCAGAAGCGGCTG 
CTGGAGGAGCGGCTGCGGGACCTGCAGAGGCTGGAGGCCATGGAGGCTGCCACCCAGGCT 
GAGGACTCTGGCTTGAGACTAGACGGGGGCAGCGGCTCCACATCCTCTTCAGGCTTCTCC 
GGCTCCCTCTTCTCTCCTGCCTCCTGCTCCATCCTCTCCGGCTCGTCCAATCAGCGAGAG 
ACCGGGGGCCTCCTGTCGCCTAGCACACCATTCGGTGCCTCGAACCTCCTGGTGAACCCC 
CTGGAGCCCCAAAATGCAGATAAGATCAAGATCAAGATCGCAGACCTGGGCAACGCCTGC 
TGGGTGCACAAGCACTTCACGGAAGACATCCAGACTCGGCAGTACCGGGCCGTCGAGGTG 
CTGATCGGCGCCGAATACGGCCCCCCGGCAGACATCTGGAGCACAGCCTGCATGGCCTTC 
GAGCTGGCCACTGGTGACTACCTGTTCGAGCCGCATTCTGGAGAAGACTACAGTCGTGAT 
GAGGACCACATCGCTCACATAGTGGAGCTTCTGGGGGACATCCCCCCAGCCTTCGCCCTC 
TCAGGCCGCTATTCCCGGGAGTTCTTCAACCGGAGAGGAGAGCTGCGGCACATCCACAAT 
CTCAAGCACTGGGGCCTGTACGAGGTACTCATGGAAAAGTACGAGTGGCCCCTAGAGCAG 
GCCACACAGTTCAGCGCCTTTCTGCTGCCCATGATGGAGTACATCCCCGAAAAGCGGGCC 
AGTGCCGCTGACTGCCTCCAGCACCCCTGGCTCAACCCCTAG 

SEQ ID NO: 91_SGK022_H 

TCTGGCCCTGTCCCTCCCCACCACCCGCCGCTGTGTCCAGACAGAGAATGTTCTAACGCT 
GGGGGCGGCTGCGGATGAAGTCCTTGGGGAGAAAAGGAGCAGGCCAAGGGCGATGGTGGA 
GTAGAGCTGCCTCTCAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGAC 
AGCATGGAGGACTTTCTGCTCTCCAATGGGTACCAGCTGGGCAAGACCATTGGGGAAGGG 
ACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACACCAAAGAAAAGTGGCAATTAAA 
GTTATAGACAAGATGGGAGGGCCATCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTC 
CAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCT 
GCCGACGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGC 
GTGCTGAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTT 
GAGGCCATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAAC 
GCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCC 
AAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAG 
GTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTC 
CTGTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTG 
TGGCAGCAGCAGJVAGGGGGTGTCCTTCCCCACTCATCTGAGCATCTCGGCCGATTGCCAG 
GACCTGCTCAAGAGGCTCCTGGAACCCGATATGATCCTCCGGCCTTCAATTGAAGAAGTT 
AGTTGGCATCCATGGCTAGCAAGCACTTGATAAAAGCAATGGCAAGTGCTCTCCAATAAA 
GTAGGGGGAGAAAGCAAA 

SEQ ID NO: 92__AA06002 6_M SGK022_M 

CAGACGGAGAATGTTCTAGCCCTGGAGGCAGCTGTGAATGAAGTCCTTGGGGGGAAAAGA 
AGCAGGCCGAGGGCGATGGTGGAGTAGAGCTGCCTCGCAGAGGCAGCATGAGCTGAGAGG 
GTGACAAGAAGGAGGCGCTACACAGCATGGAGGACTTTCTACTCTCCAATGGGTATCAGC 
TGGGCAAGACCATTGGGGAAGGGACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAAC 
ATCAAAGAAAAGTGGCAATTAAAATTATAGACAAGATGGGAGGGCCAGAAGAGTTTATCC 
AGAGATTCCTGCCTCGTGAGCTCCAGATTGTCCGTACCCTGGACCACAAAAACATCATCC 
AGGTGTATGAGATGCTGGAGTCAGCAGATGGAAAAATCTACCTGGTGATGGAACTGGCTG 
AGGGAGGGGATGTCTTTGACTGTGTGCTGAACGGAGGGCCACTTCCCGAGAGCCGGGCCA 
AGGCCCTCTTCCGCCAGATGGTTGAGGCTATTCGCTATTGCCATGGCTGTGGCGTGGCCC 
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FIGURE 2UUU 

ACCGGGACCTTAAGTGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACCGACT 
TTGGCTTTGCCAAGGTGCTACCCAAGTCACGCAGGGAGCTGAGCCAGACCTTCTGTGGCA 
GCACAGCCTATGCCGCCCCTGAGGTGCTACAGGGCATACCCCATGATAGCAAGAAAGGTG 
ATGTCTGGAGCATGGGTGTGGTCCTGTATGTAATGCTCTGTGCAAGTCTACCTTTTGATG 
ACACAGATATCCCCAAGATGCTGTGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATT 
TGGGCATCTCAACCGAATGCCAGGACCTGCTCAAGCGGCTCCTGGAACCAGACATGATAC 
TCCGGCCTTCAATCGAAGAAGTTAGTTGGCACCCATGGCTAGCAAGCACTTGATAAAAGC 
AATGGCAAGTCCTCCCCAATAAAGTAGGGGGAGAAAGCAAACTG 

SEQ ID NO: 93_AA399669_H 

CTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCGCACTTCATTCTCAA 

GTTTTGTGGCCAACGATGGATAGGAGGTGGATTGTGATGTATTCGGAACATGGGACCTTG 

AGGAGTTCCGTAACCAAAAGGAGAAAGTAACAACAGCCAGTGGAGACAAAAAGAACTGCT 

TCTCTTTCTTTCCCCCTCCAAGTTCCTAGTGGAGGGCTGAGTCCAGCATCCCAGACTCGT 

GTGACTATATAGGCAAGCATTTGGGGACCTACTTCACTTTGATACCCTAGCCTTCAGCAG 

CTCAAGGTGTTGGCCTTTGGATAGGAGGCTTCCAAGTAGTAAAGCTCCCTGCTCTCAGCA 

AGCCCAACACCATGGGGAAGGGAGATGTCTTAGAGGCAGCACCAACCACCACAGCCTACC 

ATTCCCTCATGGATGAATATGGTTATGAGGTGGGCAAGGCCATTGGCCATGGCTCCTATG 

GGTCGGTATATGAGGCTTTCTACACAAAGCAGAAGGTTATGGTGGCAGTCAAGATCATCT 

CAAAGAAGAAGGCCTCTGATGACTATCTTAACAAGTTCCTGCCCCGTGAAATACAGGTAA 

TGAAAGTCTTGCGGCACAAGTACCTCATCAACTTCTATCGGGCCATTGAGAGCACATCTC 

GAGTATACATCATTCTGGAACTGGCTCAGGGTGGTGATGTCCTTGAATGGATCCAGCGCT 

ACGGGGCCTGCTCTGAGCCCCTTGCTGGCAAGTGGTTCTCCCAGCTGACCCTGGGCATTG 

CCTACCTGCACAGCAAGAGCATCGTGCACCGGGACTTAAAGTTGGAGAACCTGTTGCTGG 

ACAAGTGGGAGAATGTGAAGATATCAGACTTTGGCTTTGCCAAGATGGTGCCTTCTAACC 

AGCCTGTGGGTTGTAGCCCTKCTTACCGCCAAGTGAACTGCTTTTCCCACCTCAGCCAGA 

CTTACTGTGGCAGCTTTGCTTACGCTTGCCCAGAGATCTTACGAGGCTTGCCCTACAACC 

CTTTCCTGTCTGACACCTGGAGCATGGGCGTCATCCTTTACACTCTAGTGGTCGCCCATC 

TGCCCTTTGATGACACCAATCTCAAAAAGCTGCTAAGAGAGACTCAGAAGGAGGTCACTT 

TCCCAGCTAACCATACCATCTCCCAGGAGTGCAAGGTCCAACTGCTCATTGCCTGTGTGG 

CACAATGGAGAAAAACTCAGGCAAGACCTCTCTCTCCCCTGCTCTAGAACCTGATCCTCC 

AGATGCTACGCCAAGCCACTAAGCGTGCCACCATTCTGGACATCATCAAGGATTCCTGGG 

TGCTCAAGTTCCAGCCTGAGCAACCCACCCATGAGATCAGGCTGCTTGAGGCCATGTGCC 

AGCTCCACAACACCACTAAACAGCACCAATCCTTGCAAATTACGACCTGAAAATGGCTGA 

GGGAGGGGGCTAAGAGAGGAGCAAAGCAGGAGGTCTTGGGCTAAAAATCTTTTTTACCAA 

AAATAAATCTAAGTCTGATTTAGTTTCATCAAAAAAA 
SEQ ID NO: 94_AA758539_H 

GACCATTCAGACGCCTCCGGTAGTGTAAATGAGGACAATGCCTGCTGGCCCACATGACGG 
GGGGATGTAGACGGCAGCGGCGCCAGTCGCTCCTGGCACCATGGACGATGCCACAGTCCT 
AAGGAAGAAGGGTTACATCGTAGGCATCAATCTTGGCAAGGGTTCCTACGCAAAAGTCAA 
ATCTGCCTACTCTGAGCGCCTCAAGTTCAATGTGGCTGTCAAGATCATCGACCGCAGGAA 
AACACCTACTGACTTTGTGGAGAGATTCCTTCCTCGGGAGATGGACATCCTGGCAACTGT 
CAACCACGGCTCCATCATCAAGACTTACGAGATCTTTGAGACCTCTGACGGACGGATCTA 
CATCATCATGGAGCTTGGCGTCCAGGGCGACCTCCTCGAGTTCATCAAGTGCCAGGGAGC 
CCTGCATGAGGACGTGGCACGCAAGATGTTCCGACAGCTCTCCTCCGCCGTCAAGTACTG 
CCACGACCTGGACATCGTCCACCGGGACCTCAAGTGCGAGAACCTTCTCCTCGACAAGGA 
CTTCAACATCAAGCTGTCTGACTTTGGCTTCTCCAAGCGCTGCCTGCGGGACAGCAATGG 
GCGCATCATCCTCAGCAAGACCTTCTGCGGGTCGGCAGCATATGCAGCCCCCGAGGTGCT 
GCAGAGCATCCCCTACCAGCCCAAGGTGTATGACATCTGGAGCCTGGGCGTGATCCTGTA 
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rATPATGGTCTGCGGCTCCATGCCCTATGACGACTCCGACATCAGGAAGATGCTGCGTAT 
rrarrRCCGGCCCGACCACAAGCTTGGAGCCAAAACCCAGCACCGGCTGCTGGTGGTGCC 

TT 



^^^"Sgaactcggtt^gctg^ccgcacaatt^qag 

GGC 



TArCTACTCCAAGGTGAAGGTGGCCACATCCAAGAAGTACAAGGGTACCGTGGCCATC 
AAGGTGGTGGACC^GCGGCGAGCGCCC 



rTGTCCATCCTGCGGGGCGTGCGACACCCGCACATCGTGCACGTCTTCGAGTTCATCGAG 
GTGTGCAACGGGAAACTGTACATCGTGATGGAAGCGGCCGCCACCGACCTGCTGCA^ 

??gca^ca!cgggcgcatccccggagttcaggcgcgcgacctctttgcgcagatcgcc 

GGCGCCG?GC^C?AC^ 

Sg?tc?tgagcccggacgagcgccgcgtcaagctcaccgac 

gc?S?ggSacccagL 

gaggtgScSSgcatcccctacgaccccaa^^ 

gtcSctIcgtcatggtcaccgggtgcatgcccttcgac 

^rrr^GcSGAAACGCGGCGTGCTCTATCCCGAAGGCCTCGAGCTGTCCGAGCGCTGC 

GT. 



AGCGCGCAACTGCTGGCTGCGCGCCGGGGACTCCGGCTAG 



SggtLagaacaggIgc^ 
tca?Saaa^ 

GGACAAA^AGGCCTTATAATTTGTGATTCTGTGGCTTTGTCTAAAAGTCCATAAAG^ 

???g™tccag5ctcacagactgctcacaacagtccacaaggctggtggggag^ 
™aatcat^actaacgacaaaaataatagaagtgaacattctt 

Sgcta^Icacacttaagaccattaagaaagc 

agSotgga^ggcaggagacaggtgtgaggaggtgggcct 

??ctotccottacttcctcagagtttatccagagattcctccctcgggagctcc^ 

rCTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCTG 

cgggaaa^?Sgc??ggtgatggagctcgctgagggaggggatg^ 

rAATGG^GGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAG^ 
G^GCAGGGOT^ 

ararrrrGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAGGTGCT 

?Sgagca?Scg^cgattgccaggacctg 

cSccggcStcaattgaagaagt™ 
gcaatggcaagtgctctccaataaagtagggggagaaagcaaaccc 
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SEQ ID NO: 97_H29974_H oor , r ,_. or ™ rrrrGTG GTTTATGAGGCAGTGGC 
TTACAGCCTGTTGGCGGAGATCGGGCGCGGCAGCTACGGCGT 

GCTACAGCTGACGAGCGCCATTGCCTTCCTG^ 

COTATCATCTG^CAATGATA^^^mTATTGACTC GCGCTGCT 
GAGGTGAAGAACCGCC 

SEQ ID NO: 9 8 -^ 49 «i°^ "' G ?^T^AAACGGGGAAAGAGAACGGAGCCCGC 
CCGTTGCTGCTCCCCCCGCCCCCCGCA^ 

AGAGGGACAAAAAGCCCGGAGCGGAAAA 

AGCCTCTTGGCGGAGATCGGGCGCGGCAGCTACGGCGTGGT^ 
CGCAGTGGGGCCAGGGTGGCAGTCAAGAAGATCCGCTCCG^ 

TTGGCACTAGCAGAATTCTGGGCCCTCACCACT 

CAGTTTGAGGAGTGCGTCCTACAGCGCAACGGG^ 

AAGAACTCACAGCTTTACCTGCGCCTG^ 

GGCTATGCTGAGGAGCCCTGCTATCTCTGGTT^ 

CTCAATCAGTATGTCCTGTCCCGGAGAOT 

CAGCTTACAAGCGCCATTGCCTTCCTC^^ 
CCAGACAACATCCTGATCACAGAGCGGTCTGGCACCCCC 

GGACTGAGCAAGGTCTGTGCAGGGCTGGCACCCC^ 
AAAAATGTGAATGTGAATAAATACTGGCTGTCCTCAG 

GCTCCCGAAGTCTGGGAGGGACACT^ 
ATCATCTGGGCAATGATAGAAAGAATTACCTT^ 
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AAGCAGCTCTTGAAAGACATGTTAGCTGCTAACCCACAGGACCGACCTGATGCTTTTGAA 
CTTGAAACCCGAATGGACCAGGTCACATGTGCTGCTTAAACTCCAGGGCTGAACGTCTTG 
GGTGTTTTTAAACTAGGTCGATCCTTCGGGACCCACAGTCTCATCGTGTCTCGGACAGGA 
TGGCAGAGGGTACAGGTGGTGGTGATCTCCTGACAGCTGGACCTCCCACAATGTGAAGCT 
CACGCTTGGGCTGCCCACTCTACCCTTCTCTTTCTCCTTCAGTAGAATAATAATTGTTTT 
TCTAAACATTAAACCATCAAGACTTCTGAAGAGCAGAAGGCTACACTCTG 

SEQ ID NO: 99_AA215311_H 

CGRCCGCGCTACGGAAAGCCGGAGGGGGGCGGGGCCGTCGGCGTAAGGGGGTGTGTCCGC 

GCGCACCACGGGGGCGCGCGCCGGCTGCTGACTGGAGGCGGCGGCAGCGGAGGCGCGAGC 

TGCCCGATAATGGCGGCCTGCAGAGCCCATGAGAGGGAGAAGCGGCAGCGTCTACCCTGA 

GAAACCTCGACCTTGAAGATGGTGAGTAGCCAGCCAAAGTACGATCTAATACGGGAGGTA 

GGCCGAGGTAGTTACGGTGTTGTGTATGAAGCAGTCATCAGAAAGACCTCTGCACGGGTG 

GCAGTGAAGAAAATTCGATGTCACGCACCTGAAAATGTTGAACTAGCCCTTCGTGAGTTC 

TGGGCACTAAGCAGTATCAAGAGCCAACATCCAAATGTGATTCACTTGGAGGAATGCATC 

CTACAAAAGGATGGGATGGTGCAAAAGATGTCCCACGGCTCTAATTCTTCCCTTTATTTA 

CAGCTTGTAGAAACTTCATTAAAAGGAGAAATTGCCTTTGATCCCAGAAGCGCCTATTAT 

TTGTGGTTTGTGATGGATTTTTGTGACGGAGGAGATATGAATGAGTATCTGTTGTCCAGG 

AAGCCCAATCGTAAAACTAACACCAGCTTCATGCTTCAGCTGAGCAGTGCCCTGGCTTTC 

TTGCATAAAAACCAGATCATCCACCGAGATCTTAAGCCTGATAACATCCTGATTTCTCAA 

ACCAGGTTGGATACCAGTGACTTGGAACCTACCCTCAAAGTGGCTGATTTTGGTCTAAGT 

AAAGTTTGTTCAGCCTCTGGGCAGAACCCAGAAGAACCTGTCAGTGTAAACAAGTGTTTC 

CTTTCCACAGCATGTGGAACAGATTTTTACATGGCTCCTGAAGTTTGGGAAGGACATTAC 

ACAGCAAAAGCTGACATCTTTGCTCTGGGGATTATCATCTGGGCAATGCTGGAAAGGATC 

ACATTCATAGACACAGAGACAAAGAAGGAACTCTTGGGGAGTTATGTAAAACAAGGAACT 

GAGATTGTGCCTGTTGGGGAGGCACTTCTGGAAAATCCCAAAATGGAACTTCTCATTCCT 

GTGAAGAAAAAATCTATGAATGGGCGAATGAAACAACTGATTAAGGAAATGCTGGCTGCA 

AACCCTCAGGATCGTCCAGATGCTTTTGAACTAGAACTCAGATTAGTACAAATTGCATTT 

AAAGATAGCAGCTGGGAAACGTGACACATATTATTTGCAAATACCATGGATGATATGCTG 

CTTCTGTTTAACAGTGATGCAACATTATGTGGCTGAAAAAGAATATAAAAAGCTAGACTC 

TACCCTCTAAGGGTTTAGATTTTTTGTGGGATTTTTTTTTTCCTCATTTTTCTTAAATCC 

AAGTTGGCCGTTTTATTAGTATGTTTCAAATGTGTATTACCAATGTGGGTGTAAATTTTT 

AAAAAATGATTATTGATAGAAGTTTGGCAGGAAAATTCTTTAAGAGCTAACAAGAGAAGA 

GAGTCCAGTTTTCTGGAAATATGTCTTTAAGTATTTTAGACATTCCTCGTCAGTATTAGG 

AATTTCCATGGGAAAAGAGGTTTGCATGCTGGTAATGCAACCTTTGAAACTTTGTAAAGG 

AAACATATATGTATATATTTATGTATATGTAAGTATGTGAATGTGCGCATTTTGCATTCC 

ATATGAAAAAAATGCCACGTCTGTTTAAATTATTTGATGTAGGTTTGGGTTTTTGAGATT 

TGCTGGTGAAGTCAGTGACGAAAAATAAACCTTCCCTTATCTTCCTACTCTGCCCCTCCC 

CCTAATGAAATCATATTAAGTNGTTTTTCCTNNTTTTTTTGTAATATACAGCTTTTTTTT 

TAAGGCATCATTTTCGAGGGTCTAAAATTATCTGGTAAAACAAATGAAATTAAGTGATCC 

AAAGCTGCTGAAGTATGTTTGAACTCTCCAGTGCCCTATAGCTGCAAGAGTTGAATTAGT 

CATGCAGTCATATGGCAGCAGGTTGGTGATT 

SEQ ID NO: 100_AA018361_H 

GCGGGGCCTCCGTATCCCCACGTGGGCCCTGCAGGAACTGGCGGGGCGCGTGACCCGGCG 
AGGCCCAGAGACAGGGGAGGGGCGCCGGGAGCCGGGCGGATCCGCGTCCCCGATGCGCGC 
TGCATTTCCGGCGGGCGGCGCTGGGGGCAGCGTGGAGCCACCCAGTGCTCGGCCCGCCCC 
GCAACCCGCCGGAACCGCCGCCCGCAGCGAGGAAGCGCCCGCGCGGGCGCAGGCGGCCGG 
AATGGCGGGGCCCGGCTGGGGTCCCCCGCGCCTGGACGGCTTCATCCTCACCGAGCGCCT 
GGGCAGCGGCACGTACGCCACGGTGTACAAGGCCTACGCCAAGAAGGACACTCGTGAAGT 
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GGTAGCCATAAAGTGTGTAGCCAAGAAAAGTCTGAACAAGGCATCGGTGGAGAACCTCCT 
CACGGAGATTGAGATCCTCAAGGGCATTCGACATCCCCACATTGTGCAGCTGAAAGACTT 
TCAGTGGGACAGTGACAATATCTACCTCATCATGGAGTTTTGCGCAGGGGGCGACCTGTC 
TCGCTTCATCCATACCCGCAGGATTCTGCCTGAGAAGGTGGCGCGTGTCTTCATGCAGCA 
ATTAGCTAGCGCCCTGCAATTCCTGCATGAACGGAATATCTCTCACCTGGATCTGAAGCC 
ACAGAACATTCTACTGAGCTCCTTGGAGAAGCCCCACCTAAAACTGGCAGACTTTGGTTT 
CGCACAACACATGTCCCCGTGGGATGAGAAGCACGTGCTCCGTGGCTCCCCCCTCTACAT 
GGCCCCCGAGATGGTGTGCCAGCGGCAGTATGACGCCCGCGTGGACCTCTGGTCCATGGG 
GGTCATCCTGTATGAAGCCCTCTTCGGGCAGCCCCCCTTTGCCTCCAGGTCGTTCTCGGA 
GCTGGAAGAGAAGATCCGTAGCAACCGGGTCATCGAGCTCCCCTTGCGGCCCCTGCTCTC 
CCGAGACTGCCGGGACCTACTGCAGCGGCTCCTGGAGCGGGACCCCAGCCGTCGCATCTC 
CTTCCAGGACTTCTTTGCGCACCCCTGGGTGGACCTGGAGCACATGCCCAGTGGGGAGAG 
TCTGGGGCGAGCAACCGCCCTGGTGGTGCAGGCTGTGAAGAAAGACCAGGAGGGGGATTC 
AGCAGCCGCCTTATCACTCTACTGCAAGGCTCTGGACTTCTTTGTACCTGCCCTGCACTA 
TGAAGTGGATGCCCAGCGGAAGGAGGCAATTAAGGCAAAGGTGGGGCAGTACGTGTCCCG 
GGCTGAGGAGCTCAAGGCCATCGTCTCCTCTTCCAATCAGGCCCTGCTGAGGCAGGGGAC 
CTCTGCCCGAGACCTGCTCAGAGAGATGGCCCGGGACAAGCCACGCCTCCTAGCTGCCCT 
GGAAGTGGCTTCAGCTGCCATGGCCAAGGAGGAGGCCGCCGGCGGGGAGCAGGATGCCCT 
GGACCTGTACCAGCACAGCCTGGGGGAGCTACTGCTGTTGCTGCGGAGCCCCCGGGCCGG 
AGGCGGGAGCTGCTTCACACTGAGGTTCAGAACCTCATGGCCCGAGCTGAATACTTGAAG 
GAGCAGATGAGGGAATCTCGCTGGGAAGCTGACACCCTGGACAAAGAGGGACTGTCGGAA 
TCTGTTCGTAGCTCTTGCACCCTTCAGTGACCCTAGAAGAATGATTGGACAGATGTGAGC 
CATCTGGAGCAGAGGGGCACTAACCCAGGCTGACGCCAAGAATGAAGTGGCCCACTGCAG 
CCCTGGCGAGCAGGCTTCTTGGATGGACAGTGCTGAGACCCCCATATCCCAGAGTCCCCA 
GCCTCCCTCAGGTTACTCTGCACCCCACAGATGGTTTGATGGCTGTGCTGTATACTGGAG 
GGGAGGGCAGGACTCTGGGAGAACAGCACTTCTTTCATGAGACCTTTGTTACTCGGTGGT 
TACTGGGTCCTGTGCCTGTCCGTTTTGGGGCATGCAGCCCTCTATCATTTTTGGCTCCGA 
GAAGAGGGCAAGGGGCCCCCGCAGGGTACTTCTGTGCTTGCCCTCGCCCTGCCAGCAGGC 
AGCTGTGCCCCTGGCCTGCCCTTCCCGGGACCCCTTATTCCAACTCAGCTCCTCTTTGCA 
CTGGAATGGGGCACTCCAACACCCCTCAGGGACCACCCTCCCCACAGTATGCACTCAGCC 
CCACAGAACCCACCAGTCTTTCTGGGAACTCACACCTGCCCGCCATCTTGGTACTTTAGG 
TTAATCCCTCAAGCATGAAAGCTGGATCTTTTGGGGTTTAAGAAGCCCAAGCCTTGTTCC 
TGCCCTGGCCTAGGGAGCACTCAGGAGGGTTCCTTGGTCCTCATCTCTCCCACCTCCGTT 
CCCTCTGGGCCCCACACTAGCCACAGCGCGGGCCTTGTGCTGGAGTTTGAGCCTGGGACA 
GGGAGAGGGAGGCTTGGAGACAGTCTGACCCAGTGCCCTCTAGGCCACCCACTTCTAGGC 
CTGCCCTGCCGCCGTGGAGCCCTGGGCAAGCTCTTTCCCCTTTCTGGGCCTGGGTCTCCC 
CATCTCTTCAATGGGGCTGATACCTTCACAGCCCACAGCATGGGCACTTATGAGGACAAA 
GTGAATTTAACCTGGAAAAGAATGTATTTGAGAGTTTCTTTTAAATAATCAGCGGGTGTT 
GGTGATTTGTAGCCCTTCTGCCCTTAAATGCTTCCTTGGGCAAGAGCTGTCTGTCCTCCC 
TGCAGGAGGCTGAGTGTGAAGAGTATCATTCATTGTTTCTCTATTAAATTATTTTCTCT 

SEQ ID NO: 101_AA311714_H 

TGGACCTGTCCTGAGGCAGAGGCCGAGATGCGCGCAACCGCGGGAGCAGCCAAGTGGACT 
GGACTCTTTTCTTGACTTAGCTACCAGGAGCTAGAGATGCTGTTATTCTATCGTATGTGA 
GAAGTCGGCCCAGAGATGGAAAACTTTATTCTGTATGAGGAGATCGGAAGAGGAAGCAAG 
ACTGTTGTCTATAAAGGGCGACGGAAGGGAACAATCAATTTTGTAGCCATTCTTTGTACT 
GATAAGTGCAGAAGGCCTGAAATAACCAACTGGGTCCGTCTCACCCGTGAAATAAAACAC 
AAGAATATTGTAACTTTTCATGAATGGTATGAAACAAGCAACCACCTCTGGCTAGTGXAT 
GAAAACCTCCCAGAAGATGTTGTGAGAGAATTTGGAATTGACCTGATTAGTGGATTACAT 
CATCTTCATAAACTTGGCATTCTCTTTTGTGACATTTCTCCTAGGAAGATACTCTTGGAA 
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GGGCCTGGCACACTGAAGTTTAGCAACTTTTGCTTGGCAAAAGTGGAAGGTGAAAATTTG 
GAAGAGTTCTTTGCTTTGGTGGCAGCAGAGGAAGGAGGAGGTGATAATGGGGAAAATGTC 
CTGAAGAAAAGCATGAAAAGTAGAGTCAAAGGATCTCCTGTATATACAGCACCAGAAGTT 
GTGAGGGGTGCTGACTTTTCCATCTCCAGTGACCTCTGGTCTTTGGGCTGTCTGCTTTAT 
GAAATGTTTTCAGGAAAACCTCCATTCTTCTCAGAAAGTGTTTCAGAATTAACTGAAAAG 
ATCTTATGTGAAGATCCTTTGCCACCTATTCCGAAAGATTCTTCTCGTCCTAAAGCTTCT 
TCAGATTTTATTAATTTGCTTGATGGGTTACTTCAAAGAGATCCTCAGAAAAGATTGACT 
TGGACAAGGCTACTGCAGCATTCATTTTGGAAGAAAGCTTTTGCTGGAGCAGATCAGGAA 
TCAAGCGTCGAAGATCTCAGTCTCAGCAGA7VACACTATGGAGTGTTCTGGGCCACAAGAT 
TCCAAGGAGCTTTTGCAGAACTCTCAGAGTAGACAAGCAAAAGGGCACAAGAGTGGTCAA 
CCACTAGGTCACTCTTTCAGACTAGAAAATCCAACTGAGTTTCGGCCTAAGAGTACTCTT 
GAGGGTCAATTGAATGAATCCATGTTTCTTCTCAGTTCTCGTCCTACTCCCAGAACTAGC 
ACTGCAGTGGAAGTAAGTCCTGGTGAGGATATGACTCACTGTTCACCACAGAAGACTTCT 
CCTCTGACCAAGATTACAAGTGGACACCTGAGTCAGCAGGACCTGGAATCCCAGATGAGA 
GAGCTTATCTACACGGACTCAGATCTTGTTGTCACCCCCATTATCGACAATCCAAAGATA 
ATGAAACAGCCACCAGTTAAATTTGATGCAAAAATATTGCATCTACCAACATATTCAGTG 
GATAAGTTATTATTTCTGAAAGATCAAGATTGGAATGACTTTTTGCAACAAGTGTGCTCG 
CAGATCGACTCCACTGAGAAGAGCATGGGGGCCTCCCGAGCCAAGCTGAATCTCCTTTGC 
TATTTGTGCGTGGTGGCTGGTCACCAGGAGGTGGCCACCAGGCTCCTCCATTCCCCCCTG 
TTCCAATTGCTAATCCAGCATTTGCGGATAGCTCCAAACTGGGATATACGGGCCAAGGTT 
GCTCACGTGATTGGTTTACTGGCTTCGCACACAACTGAGCTCCAGGAAAATACACCTGTT 
GTTGAGACTACAAGCTCCATTGGAATCGGGATTTTGAACTGTCTTGTTCAACACTCCACT 
CCAGTGCCTAGACAGTGCCTTGTGTATGTATAGATACTGACAAATATTTCAAAATAAATA 
AAACTGTATCAGCATT 

SEQ ID NO: 1 02_SGK3 84__H 

TCTTTGGCCCACGTGCTGAGGGCGCGGCAGATCCTGACGGAGCCAGAAGTGCGCGACTAC 
CTGCGGGGCCTGGTCAGCGGCCTGCGCTACCTGCACCAGCGGTGCATCCTGCACCGC 

SEQ ID NO: 1 03_AA2 1 04 51_M SGK384_M 

GGTCTGCTGCATGGATAATGGACTGGAACACAGAAAGACCATGCAGGGTTCGGCTGTAGA 
AGGCCAGTATCTCCAGAGGCCAGAAGACACCATCAGATCTCCTGGGACTGGAGTTATAGA 
GGTTGTGAGCTGCCATGTTGAACCAAGCAGGTCACTGAGGGACACAGGCATGTGGATGGA 
AACCCTGCTGGGAGAAAAAAGAAACTGCTGAAGGGACTGACATGGGACAGCAACATGGAA 
CCAGGAATGGTCTCACGCATAGAGAGCTCCCCCGGGGCGTGGGGCTGCTGCTCGCCATGG 
CCCTTATGAACGTGGCGCTCTACCTCTGCCTTGATCAGCTTTTCATCTCCCCTGGACGAT 
CCACCGCGGACTCTAGGCGCTGTCCTCCGGGCTACTTCAGAATGGGGCGGATGAGAAACT 
GCTCACGCTGGCTGTCCTGTGAAGAGCTGAGGACAGAAGTCAGGCAGCTGAAGCGCGTTG 
GGGAGGGAGCCGTGAAGAGAGTCTTTCTGTCTGAATGGAAGGAACACAAAGTCGCTCTCT 
CCCGGCTCACCAGGCTGGAGATGAAGGAGGACTTCCTGCATGGGCTGCAGATGCTGAAGT 
CTCTACAGAGTGAGCACGTGGTCACGCTGGTGGGCTACTGTGAGGAAGATGGCACTATTC 
TCACCGAATATCACCCCTTAGGTTCCTTGAGCAACCTGGAAGAAACACTAAACCTTTCAA 
AGTACCAAGACGTGAACACTTGGCAGCACAGGCTGCAGCTGGCCATGGAGTACGTCAGCA 
TCATTAACTATCTGCATCACAGCCCCCTGGGCACGAGGGTCATGTGTGACTCTAACGACC 
TGCCCAAAACATTGTCCCAGTACCTGCTAACAAGTAACTTCAGCATTGTGGCAAACGACC 
TGGACGCTCTGCCCCTGGTAGACCATGACTCTGGGGTACTTATAAAGTGTGGCCACAGAG 
AGCTCCATGGGGATTTTGTGGCTCCAGAGCAGCTGTGGCCCTACGGAGAAGACACGCCCT 
TCCAAGACGATCTCATGCCTTCCTACAATGAGAAGGTTGACATCTGGAAGATTCCAGATG 
TCTCCAGTTTCCTCTTGGGGCACGTGGAAGGGAGTGATATGGTTAGATTCCATTTGTTTG 
ATATCCATAAGGCGTGCAAGAGCCAGATCCCGGCAGAAAGACCCACTGCTCAGAACGTGC 
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TAGACGCTTACCAGAGGGTTTTCCATTCACTCCGAGACACTGTGATGTCGCAGACGAAAG 
AAATGCTGTAAAAATGAGCCATCGAGTGACGTGCTTGATGGCTGAATGGCATCCCAGCTG 
TTCCGCTCTTGATGATGGAAGAGCTTTGCATGGATGGATGTTGACCCTGGCTGTTCAGCC 
ACGTAGGCCTCCTCTACGTCTGCCTGCATGTTTGAGTGTTCTGCTCTCCTGGCAGCCCGG 
ATGGAAGCTGCCAAGCGAGAAAGCCTGGCTTCAGGATGCTCCCTGGTGAAGATGCAGAGG 
ATTCTGGATCTGCATAGTTTCAAGGGAGTGATCAAACGGTGACCTTGAAGACATGCTGCC 
TGCCTTGGTAACTTTTTATAGACTAGTAGGAAACAGAAATCTTTTGGGGGAGGGGGGGAC 
AACCCACTAGTTCCTCAGAGACAATTTCTTCTCATTCAGAAAGCCCTGTTGGAAGCTGGG 
GATGTTTTAACTCCGTGGCAGGGCACTTGCCTAGTTGTGTGCAAAGCCTTGGATCTGACC 
CATGGCATGTGCACACACACAAATGCTCAAAGAAAATCCCAGACGCCAGAAGTGTGCCCC 
TTTCTTGTCAATAAGGTCATTGTCAGTACCGGAGATGATTTTTTTTATGAAGCGTTTATG 
CTGACTCGTGTCACTGAGCCAAGTGTGCATGGTCGTTAGCTACTTTGTGGGTTCTTCTTT 
CTTTCTACCCTACTTCTTCCCTTTCCACCCCCTAACACTAGATAGGAGAGAGGAGAGAGA 
AAGGAAAGTGGGCACTGTTATATTGTTGGACGACTTCTTGCTGATTAAGGGGTGTCGAGT 
TCCTTGGAGCAATGATCTTTGCTGCCAAGATATCTCATTTCTTCTTGTTTCTTCTTCGCC 
CACGACCACTTCACAAACACCGACCAACAGCAAACAACAACCCACCCCGCTTCTCGGGGG 
CCCTAGCACTTATGTACTTCTGAAAAGTCCCCAGAAATTCCAATCATCACACACTCAGAG 
AAACTGTCTGCTGCTGGCAAAACTACACCCCTGCTAGAGCATGAGGCAAATCATAGTCAG 
CTGCTGTGGACAGTCTGAAGCAGCCTGGCATCCCACACCTGAGATTAAAACAAAAACATT 
CTTACCTGTGTTTTGTTTTTGTTTTAAGAAACCAAAGTGCACCAAGATAGCATGCTCTTG 
AGATTGTGGCTGTCTAGAGATTTTTGGAACAGCAAGTTGAAGGAACTTTCTTACCTGCCT 
TGAATGGTGCTTTGAACTTCCTGCTGACCTGGAGTTTCTGTGTGAATATTTCTATCCAGT 
GTCCCCCTGTACCGGAAAGTACAAAGTCTGCTCTGGGCTTGCATGCCTGAACACTTTAAA 
ACACTGTGGAGCCAGGAATAATGGTACCCACCTGTAATCCCAGCACCTGGGAGACAGGAG 
GAACCAGGAGTTCAGGGTTATCCTGGGCTATATACCGTGACCCTGTCTACCCCCACACCC 
CAATAAAAAAACAAAAAGGTC 

SEQ ID NO: 1 04_SGK071_2_H 

GAGGTGGTGGCTGTGCAGATGATGGTGGAATGCATGGATGACCATTACGCCAGTCAGGCC 
CTGGAGGAGCTGATGCCACTGCTGAAGCTGCGGCACGCCCACATCTCTGTGTACCAGGAG 
CTGTTCATCACGTGGAATGGGGAGATCTCTTCTCTGTACCTCTGCCTGGTGATGGAGTTC 
AATGAGCTCAGCTTCCAGGAGGTCATTGAGGATAAGAGGAAGGCAAAGAAAATCATTGAC 
TCTGAGTGGATGCAGAATGTGCTGGGCCAGGTGCTGGACGCGCTGGAATACCTGCACCAT 
TTGGACATCATCCACAGGAATCTCAAACCCTCCAACATCATCCTCATCAGCAGTGACCAC 
TGCAAACTGCAGGACCTGAGTTCCAATGTGCTAATGACAGACAAAGCCAAATGGAATATT 
CGTGCGGAGGAAGACCCCTTTCGTAAGTCCTGGATGGCCCCTGAAGCCCTCAACTTCTCC 
TTCAGCCAGAAATCAGACATCTGGTCCCTGGGCTGCATCATTCTGGACATGACCAGCTGC 
TCCTTCATGGATGGCACAGAAGCCATGCATCTGCGGAAGTCCCTCCGCCAGAGCCCAGGC 
AGCCTGAAGGCCGTCCTGAAGACAATGGAGGAGAAGCAGATCCCGGATGTGGAAACCTTC 
AGGAATCTTCTGCCCTTGATGCTCCAGATCGACCCCTCGGATCGAATAACGATAAAGGAC 
GTGGTGCACATCACCTTCTTGAGAGGCTCCTTCAAGTCCTCGTGCGTCTCTCTGACCCTG 
CACCGGCAGATGGTGCCTGCGTCCATCACCGACATGCTGTTAGAAGGCAACGTGGCCAGC 
ATTTTAGGTGATGCTGGGGACACAAAGGGGGAGCGTGCCCTGAAGCTCCTGTCCATGGCC 
TTGGCATCCTATTGTTTAGTTCCAGAGGGTTCATTATTTATGCCCCTGGCCTTGCTCCAC 
ATGCACGACCAGTGGCTCAGCTGTGACCAGGACAGAGTCCCTGGGAAGAGAGACTTTGCC 
TCCCTGGGGAAACTAGGGAAGCTGTTGGGCCCCATCCCAAAGGGTCTGCCGTGGCCCCCG 
GAGCTGGTGGAGGTGGTGGTCACGACCATGGAGCTACATGACAGGGTCCTCGATGTCCAG 
CTGTGTGCCTGCTCCCTGCTGCTGCACCTCCTGGGCCAAGCGCTGGTGCACCACCCGGAA 
GCCAAGGCTCCCTGCAACCAAGCCATCACCTCCACCCTGCTGAGTGCTCTTCAGAGCCAC 
CCCGAGGAGGAGCCACTTCTTGTCATGGTCTACAGCCTGCTAGCCATCACCACAACCCAG 
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GAGTCAGAGTCACTGTCAGAGGAGCTGCAGAACGCTGGGCTGCTGGAGCACATCCTGGAG 
CACCTCAACAGCTCCCTCGAAAGCAGGGACGTCTGCGCCAGCGGCCTGGGCCTGCTCTGG 
GCCCTCCTGCTGGACGACCCCATCTTGGCACTCCAGCGCCCCAGGAAAAAGAGAGCTCCA 
AACCACGGAAAGCCCGGGAAACCCAAGAACCCTGCCAGCACCCAAAGTATCATTGTGAAC 
AAGGCCCCCTTGGAGAAGGTCCCGGACCTCATCAGCCAGGTGTTGGCCACCTACCCTGCG 
GATGGGGAAATGGCAGAAGCCAGCTGCGGAGTCTTCTGGCTGCTGTCCCTGCTGGGCTGC 
ATCAAGGAGCAGCAGTTTGAACAAGTGGTGGCGCTGCTCCTGCAAAGCATCCGGCTGTGC 
CAGGACAGAGCCCTGCTGGTGAACAATGCCTACCGGGGACTGGCCAGCCTGGTGAAGGTG 
TCAGAGCTGGCGGCCTTCAAGGTGGTGGTGCAGGAGGAGGGCGGCAGTGGCCTCAGCCTC 
ATCAAGGAGACCTACCAGCTCCACAGGGACGACCCGGAGGTGGTGGAGAACGTGGGCATG 
CTGCTGGTCCACCTGGCTTCCTATGAGGAGATCCTGCCGGAGCTGGTGTCCAGTAGTATG 
AAGGCCCTGCTCCAGGAGATCAAGGAGCGCTTCACCTCCAGCCTGGTGAGTGACAGCAGC 
GCCTTCAGCAAACCAGGCCTCCCTCCAGGTGGAAGCCCCCAGCTGGGGTGCACCACGTCT 
GGGGGACTGGAATAG 

SEQ ID NO: 105_AA1183 52_M SGK0 71_M 

CAGAAGAAGACCCCTGCCAGAAGTCCTGGATGGCTCCTGAAGCTCTCAAATTCTCCTTCT 
CCACCAAATCCGACATCTGGTCTCTGGGCTGCATCATTCTAGACATGGCCACTTGCTCCT 
TCCTGAACGACACAGAAGCCATGCAACTGCGGAAGGCCATCCGCCATCATCCAGGCAGCC 
TGAAGCCCATCCTGAAAACCATGGAGGAGAAGCAAATCCCTGGTACAGATGTCTACTATT 
TGCTTCTGCCCTTCATGTTGCATATCAACCCCTCCGATCGACTGGCAATCAAGGATGTGA 
TGCAAGTCACCTTCATGAGCAACTCCTTCAAAAGCTCCTCTGTTGCGCTGAATATGCAGC 
GGCAGAAGGTCCCCATCTTCATCACTGACGTGCTGCTTGAAGGCAACATGGCCAACATCT 
TAGGCAGCTGGCTGTGTGCTTCCTTTGTGAACGACAGCAGGCACTGTGACTCAGGGATTG 
GCTCGCAGAGACTTGGGTTTGATTTTCAGTCAGTCTCTTGGACAGAGCACCCTCTGAAAG 
ATGTCATGCAGAATTTCTCCAGTCGACCAGAGGTCCAGCTCAGAGCCATTAACAAGTTGT 
TGACAATGCCAGAGGACCAGCTAGGGCTGCCATGGCCCACAGAGCTGCTGGAAGAGGTGA 
TCAGCATCATAAAGCAGCATGGGCGGATCCTGGATATTCTGCTCAGCACCTGCTCCCTTC 
TGCTGCGTGTTCTTGGCCAAGCACTGGCAAAGGACCCAGAAGCTGAGATCCCAAGGAGCA 
GTTTGATCATCTCCTTCCTGATGGATACCTTGCGGAGCCATCCTAACTCTGAAAGGCTTG 
TTAATGTGGTCTACAACGTGCTTGCCATTATTTCCAGCCAAGGACAGATCTCAGAAGAGC 
TGGAAGAGGAGGGGTTGTTTCAGCTTGCCCAAGAGAACCTGGAGCACTTCCAAGAGGACA 
GGGACATCTGCCTCTCTATCCTGAGCCTGCTCTGGTCCCTCCTGGTAGATGTTGTCACTG 
TGGACAAAGAGCCCTTGGAGCAGCTCTCTGGCATGGTCACCTGGGTGCTGGCTACTCATC 
CGGAGGACGTGGAAATAGCAGAGGCTGGCTGTGCGGTGCTCTGGCTGCTGTCCTTGTTGG 
GCTGCATAAAGGAGAGTCAGTTTGAGCAGGTGGTAGTGCTGCTCCTGAGAAGCATCCAGC 
TGTGCCCTGGCAGAGTACTGCTGGTGAACAATGCATTCCGTGGCTTGGCCAGCCTCGCAA 
AGGTGTCCGAACTGGTGGCCTTCCGAATAGTAGTACTGGAAGAGGGCAGCAGCGGCCTCC 
ACCTCATCCAAGATATCTACAAGCTCTACAAGGATGACCCTGAGGTGGTGGAGAACCTCT 
GCATGCTGTTGGCCCATCTGACCTCCTACAAGGAGATCCTGCCAGAGATGGAGTCTGGAG 
GCATCAAAGACCTAGTCCAGGTGATCCGGGGGCGCTTTACCTCCAGCCTGGAGCTGATTT 
CTTACGCTGATGAGATACTCCAGGTACTGGAAGCAAATGCACAACCTGGCCTCCAGGAGG 
ATCAGCTTGAGCCTCCTGCAGGGCAGGAAGCCCCACTGCAGGGAGAGCCCCTCTTCAGGC 
CCTGACATGCTGCCCTTCTGGTCCTGTGGTAAGAGAAAGTATCACTAGGTCCAGTATTAA 
TTTCGTACCCCATGGTGACTAATAAAAGAAGCCCTAGGCTGTTTCTGGC 

SEQ ID NO: 106_018653 . 9_H 

GGCCGGGGTCGGGGCGCGGGGCATGCGCGCGGGCTGGGCAGGGGGCCGGCGGGGCGCAGA 
GCGGAGCCGCCTCGGAGCCTGAGCCGCCCGGGGCCGGGGCCGGGGAGCCGCGCGGGGCCG 
GCCGGCCGGGGGGAGGGGAGCGATGCGGCGCCGGCGGGCGGCAGTGGCCGCGGGTTTCTG 
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CGCCTCCTTCCTGCTGGGCTCCGTCCTCAACGTGCTCTTCGCTCCGGGTCGGAGCCTCCG 

AGGCCAGGCCAGTCCCCTGAGCCTTCGCCGGCCCCGGGTGCGGGCCGTCGCGGGGGCCGC 

GGGGAGCTGGCCCGGCAGATCCGGGCGCGCTACGAGGAGGTGCAGCGCTATTCCCGCGGG 

GGCCCCGGGCCCGGGGCGGGCCGGCCGGAGCGGCGGCGCCTGATGGACCTGGCTCCGGGC 

GGGCCCGGCCTGCCGCGCCCCCGGCCCCCTTGGGCCCGGCCCCTGTCCGACGGCGCCCCA 

GGCTGGCCCCCGGCTCCCGGCCCAGGCTCCCCCGGCCCGGGCCCGCGCCTGGGCTGCGCC 

GCGCTTCGCAACGTGTCCGGCGCGCAGTACATGGGCTCAGGCTACACCAAGGCCGTGTAC 

CGGGTCCGCCTGCCCGGCGGTGCCGCGGTGGCGCTCAAGGCGGTGGACTTTAGCGGCCAC 

GATCTGGGCAGCTGCGTGCGCGAGTTCGGGGTACGGAGGGGCTGCTATCGGCTGGCGGCC 

CACAAGCTGCTTAAGGAGATGGTGCTGCTGGAGCGGCTGCGGCACCCCAACGTGCTGCAG 

CTCTATGGCTACTGCTACCAGGACAGCGAGGACATCCCAGACACCCTGACCACCATCACG 

GAGCTGGGCGCCCCTGTAGAAATGATCCAGCTGCTGCAAACTTCCTGGGAGGATCGATTC 

CGAATCTGCCTGAGCCTGGGCCGCCTCCTCCACCACCTGGCCCACTCCCCACTGGGCTCC 

GTCACTCTGCTGGACTTCCGCCCTCGGCAGTTTGTGCTGGTGGATGGGGAGCTCAAAGTG 

ACGGACCTGGATGACGCACGTGTGGAGGAGACGCCGTGTGCAGGCAGCACCGACTGCATA 

CTCGAGTTTCCGGCCAGGAACTTCACCCTGCCCTGCTCAGCCCAGGGCTGGTGCGAGGGC 

ATGAACGAGAAGCGGAACCTCTATAATGCCTACAGGTTTTTCTTCACATACCTCCTGCCT 

CACAGTGCCCCGCCTTCACTGCGTCCTCTGCTGGACAGCATCGTCAACGCCACAGGAGAG 

CTCGCCTGGGGGGTGGACGAGACCCTGGCCCAGCTGGAGAAGGTGCTGCACCTGTACCGG 

AGCGGGCAGTATCTGCAGAACTCCACGGCAAGCAGCAGTACCGAGTACCAGTGTATCCCA 

GACAGCACCATCCCCCAGGAAGACTACCGCTGCTGGCCATCCTACCACCACGGGAGCTGC 

CTCCTTTCAGTGTTCAACCTGGCTGAGGCTGTGGATGTCTGTGAGAGCCATGCCCAGTGT 

CGGGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGCAGCTGGTCTTTTTCAAG 

ACTGGATGGAGCCAAGTGGTCCCTGATCCCAACAAGACCACATATGTGAAGGCCTCTGGC 

TGACCTATCTGAGGGCTCGGCTGACCAGCTGACTATCCTCAGCAGCTGGGCTTGCCTGTG 

GAGGGAGTGACTTGCACTGGCAGCACTGCATGTCACCTGGGAACCCCTGCAGACAAAGCT 

AACATCCCAGACAGACAGATGTGACCAGGACAAACGTGCAATAATGCCAAATGTTAAAAT 

GTGAGTTTACCAGCCTAGCTATGGGACTGCTGGCTCCTAGTCCAGGAATCATGGGGGTAT 

GACTGCCTCTCCAACCCTGTGGGCTGTAAGCAAGCTCAGGCTAGTCTCCCCACTGGGGGC 

TGTGCCCCTCCCTGGGACGGTTCCGTGGGCAGCCCCATCACTGTGTTCAATAGTGTGAGA 

ATGTAGCTAAAGCCCCTGCTGCTGCTGCTGCACATGCCACAGCAGGCGGTGGGGGCTGCG 

TGGGGACAATCCATCGTGGAGTGTTCTCTCAGCTTAGGTCTGGACAGGAGACTTGGCGGG 

AGATGCTCCAGGATGTGGGTGATTCTGTACCTGGGGAGGCTATCTCTGACCTCCCGACAG 

GGGACACTCCCAGGCCAGCCCAGGGGTCAGGGGCAGAGGTGCACACCTCAGCATGAGCCA 

AGACTGGGGTCAGGGAGCAGGTGTGGTTTGAGCCAGGACCTGGGGCGGGGGTGGGGCCGG 

GGCCTTTCTGCCTCATTTGCTTTCAATGAAAGCCTCAAAGCAGCCAAAACCAGGCTTTCC 

CCCTTCCTCGAGTTTGAATATCCAGAATCTTTTGTACTTCTTGTTGGTTAAATTGTTTAT 

TTTTGTAAAAAATAAAATAAAATTAGTTAATAAAATGATGTTTCACAGCAAACTCTTCCC 



SEO ID NO: 107 AA396601_M 

CCACGCGTCCGGGCTGCGCCGCGCTCCGCAACGTGTCTGGCGCGCAGTACGTGGGCTCAG 
GCTACACTAAGGCTGTGTACCGGGTCCGCCTGCCCGGCGGCGCCGCGGTGGCGCTTAAAG 
CAGTGGACTTCAGCGGCCACGATCTGGGCAGCTGCGTGCGCGAGTTCGGGGCGCGAAGGG 
GCTGCTATCGCCTGGCGGCCCACAAGCTGCTCAAAGAGATGGTGCTGCTGGAGCGGCTGC 
GGCACCCCAACGTGCTGCAGCTCTATGGCTATTGCTACCAGGACAGTGAGGGCATCCCAG 
ACACGCTGACCACCATCACAGAGCTGGGTGCCCCTGTGGAGATGATCCAGCTGTTGCAGA 
CTTCCTGGGAGGATCGATTCCGAATCTGCCTCAGCCTTGGCCGCCTCCTCCACCACCTGG 
CCCACTCCCCGCTGGGCTCGGTCACCCTGCTTGACTTCCGCCCTCGGCAGTTTGTGCTAG 
TGAACGGGGAGCTGAAAGTGACAGACCTGGATGATGCCCGCGTGGAAGAGACACCGTGCA 
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FIGURE 2DDDD 

CCAGCAGTGCCGACTGCACGCTAGAGTTTCCAGCCAGGAACTTCAGCCTGCCCTGCTCGG 
CCCAGGGCTGGTGCGAGGGCATGAATGAGAAACGGAACCTCTACAATGCCTACAGGTTCT 
TCTTCACATACCTCCTGCCACACAGTGCCCCGCCTTCCCTCCGACCTCTCCTGGATAGCA 
TCGTCAATGCCACGGGAGAGCTCGCCTGGGGGGTGGATGAGACCCTGGCCCAGCTGGAGA 
CAGCGCTACACTTGTTCCGAAGTGGGCAGTACCTGCAGAACTCTACAAGCAGCAGGGCTG 
AGTACCAGCGCATCCCGGACAGTGCCATCACACAGGAGGACTATCGCTGCTGGCCATCCT 
ATCACCACGGCGGCTGCCTCCTGTCCGTGTTCAACCTGGCTGAGGCTATAGATGTCTGTG 
AGAGCCATGCTCAGTGTCGTGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGA 
AGCTGGTCTTTTTTAAGACTGGATGGAACCAAGTGGTCCCTGATGCCGGCAAGACCACAT 
ATGTGAAGGCCCCTGGTTGACTGGTTGTGGGCTCAGCTGACCAGCTGGGCTTGCCTGCTG 
CAGGCGTGACTTGCATCCCACCTGGGAACCCCTGCAGACAAAAGCTAGCTCCCAGAGCAA 
CTGATGTGACCAGGACAAAACGTGCAATATGCAAAAATGTTAAAATGTGAGTTTGCCAGC 
TTCAGTCCCAGACTGGTTGGAACCCGATTGCCTCTCTGGAGCTGTAGGCTGTGAGCAGGG 
CTCAGGCTGGTCTTAACTGGGACAGTCCCGTGGGCAGCCCATTACTGCATTCATGCTTTG 
AGAATGTAGCCAGAACACTGCTGCTGCATAAGCCACCGTGGGCAGGAGCTGCCTGGGGAC 
AACCAGTCTCAGAGTGCTCTCTCAGCTCAGCTCCGCTCCAAATGGAGAGCGCGGGATGCG 
GAGATGTGAGTGAACCAGCACTGGGAAGAAGGCTCTCGGGCCTCTCCCTAGAGGTTGCTC 
CTAGGCCAGCCCCGAGGCCGTGGGCAGCAGTGCTCGCATCCATATGAGCCAAGACTAGAG 
TGGAGGAGCAGATTGCATTTGAGCCAGGACTGGGGTGGGGGTAGGGTCGGGGCCTCTCTG 
CCTCATTTGCTTTCAGTGAAAGCCAGGGAGCAGCCGCAGCCAGGCTCCTCCCACTCCTGG 
AGGCCAGGCTCCTCCCCCTCCTGGAGGCCAGGCTCCTCCCCCTCCTGGAGTTTGCGTACC 
CAGAAGCTTTTATACTTCTCGTTCATTAAATTGTTTATTTTTGTAAAAAAAAAAATTAAT 

CAATTAATAAAATGATGTTTTGTGAC 
SEQ ID NO: 108_VRK3_H 

ATGATCTCCTTCTGTCCAGACTGTGGCAAAAGTATCCAAGCGGCATTCAAATTCTGCCCC 
TACTGTGGAAATTCTTTGCCTGTAGAGGAGCATGTAGGGTCCCAGACCTTTGTCAATCCA 
CATGTGTCATCCTTCCAAGGCTCAAAGAGAGGGCTGAACTCCAGTTTTGAAACCTCTCCT 
AAGAAAGTGAAATGGTCCAGCACCGTCACCTCTCCCCGATTATCCCTCTTCTCAGATGGT 
GACAGTTCTGAGTCTGAAGATACTCTGAGTTCCTCTGAGAGATCCAAAGGCTCCGGGAGC 
AGACCCCCAACCCCCAAAAGCAGCCCTCAGAAGACCAGGAAGAGCCCTCAGGTGACCAGG 
GGTAGCCCTCAGAAGACCAGCTGTAGCCCTCAGAAGACCAGGCAGAGCCCTCAGACGCTG 
AAGCGGAGCCGAGTGACCACCTCACTTGAAGCTTTGCCCACAGGGACAGTGCTGACAGAC 
AAGAGTGGGCGACAGTGGAAGCTGAAGTCCTTCCAGACCAGGGACAACCAGGGCATTCTC 
TATGAAGCTGCACCCACCTCCACCCTCACCTGTGACTCAGGACCACAGAAGCAAAAGTTC 
TCACTCAAACTGGATGCCAAGGATGGGCGCTTGTTCAATGAGCAGAACTTCTTCCAGCGG 
GCCGCCAAGCCTCTGCAAGTCAACAAGTGGAAGAAGCTGTACTCGACCCCACTGCTGGCC 
ATCCCTACCTGCATGGGTTTCGGTGTTCACCAGGACAAATACAGGTTCTTGGTGTTACCC 
AGCCTGGGGAGGAGCCTTCAGTCGGCCCTGGATGTCAGCCCAAAGCATGTGCTGTCAGAG 
AGGTCTGTGCTGCAGGTGGCCTGCCGGCTGCTGGATGCCCTGGAGTTCCTCCATGAGAAT 
GAGTATGTTCATGGAAATGTGACAGCTGAAAATATCTTTGTGGATCCAGAGGACCAGAGT 
CAGGTGACTTTGGCAGGCTATGGCTTCGCCTTCCGCTATTGCCCAAGTGGCAAACACGTG 
GCCTACGTGGAAGGCAGCAGGAGCCCTCACGAGGGGGACCTTGAGTTCATTAGCATGGAC 
CTGCACAAGGGATGCGGGCCCTCCCGCCGCAGCGACCTCCAGAGCCTGGGCTACTGCATG 
CTGAAGTGGCTCTACGGGTTTCTGCCATGGACAAATTGCCTTCCCAACACTGAGGACATC 
ATGAAGCAAAAACAGAAGTTTGTTGATAAGCCGGGGCCCTTCGTGGGACCCTGCGGTCAC 
TGGATCAGGCCCTCAGAGACCCTGCAGAAGTACCTGAAGGTGGTGATGGCCCTCACGTAT 
GAGGAGAAGCCGCCCTACGCCATGCTGAGGAACAACCTAGAAGCTTTGCTGCAGGATCTG 
CGTGTGTCTCCATATGACCCCATTGGCCTCCCGATGGTGCCCTAG 
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FIGURE 2EEEE 
SEQ ID NO: 109_S71575_M VRK3JVI 

CCATCCCCACCTGTATCGGCTTTGGCATTCACCAGGACAAGTACAGGTTCCTAGTATTCC 
CCAGCCTGGGGAGGAGCCTTCAGTCAGCCCTGGATGACAACCCAAAGCATGTGGTATCAG 
AGAGATGTGTGCTTCAGGTGGCCTGCAGGCTGCTGGATGCTCTGGAGTATCTCCATGAAA 
ATGAGTATGTTCACGGGAACCTGACAGCTGAGAATGTCTTTGTGAATCCAGAGGATCTGA 
GCCAGGTGACCCTGGTGGGCTATGGCTTCACCTACCGATACTGCCCAGGTGGCAAACACG 
TGGCCTACAAAGAAGGCAGCAGGAGTCCACACGATGGGGACTTGGAGTTCATTAGCATGG 
ACCTGCACAAGGGATGCGGACCCTCCCGCCGCAGCGATCTCCAGACCTTGGGCTACTGTA 
TGCTCAAGTGGCTTTATGGGTCCCTGCCATGGACAAATTGCCTTCCCAACACCGAAAAGA 
TAACTAGGCAGAAGCAGAAGTATCTGGACAGCCCCGAGCGCCTCGTGGGACTGTGTGGCC 
GCTGGAACAAGGCCTCAGAGACCCTGCGGGAGTACCTGAAGGTGGTGATGGCCCTCAATT 
ATGAGGAGAAGCCACCCTATGCCACGCTGAGGAACAGCCTAGAAGCTCTGCTGCAGGATA 
TGCGGGTGTCACCCTATGACCCTCTGGACCTCCAGATGGTGCCTTAGATGGAATCCAGAG 
CTTCCGACTTGCAGCTTGAAGTAGAACATGAAGTAGTGTGACTGGAGGCCTGTTTGAACT 
CATAGCTCCTAAAAG7^ATCCCTTGAATGTGCATTCTCACCGCTCCCTTAGGACATATG7^A 
TCAGCACTTGTGTTGGGGAACCTGAGTCATGTCATGTAATGTGAAACTCCTCCCTGTCTC 
AGCTCTGGCAGCTGTGGATGGAGGTAAGTGGATGCTGGCGGCGGCGGCGGCAGCAGCCAC 
TCCACTCCCTATGGCATTTCTGTGATGGCATAATAAACTGTTTTTAATC 

SEQ ID NO: 1 1 0_AA4 5427_H 

ATGGGCCACGCGCTGTGTGTCTGCTCTCGGGGAACTGTCATCATTGACAATAAGCGCTAC 
CTCTTCATCCAGAAACTGGGGGAGGGTGGGTTCAGCTATGTGGACCTAGTGGAAGGGTTA 
CATGATGGACACTTCTACGCCCTGAAGCGAATCCTGTGTCACGAGCAGCAGGACCGGGAG 
GAGGCCCAGCGAGAAGCCGACATGCATCGCCTCTTCAATCACCCCAACATCCTTCGCCTC 
GTGGCTTACTGTCTGAGGGAACGGGGTGCTAAGCATGAGGCCTGGCTGCTGCTACCATTC 
TTCAAGAGAGGTACGCTGTGGAATGAGATAGAAAGGGTGAAGGACAAAGGCAACTTCCTG 
ACCGAGGATCAAATCCTTTGGCTGCTGCTGGGGATCTGCAGAGGCCTTGAGGCCATTCAT 
GCCAAGGGTTATGCCCACAGAGACTTGAAGCCCACCAATATATTGCTTGGAGATGAGGGG 
CAGCCAGTTTTAATGGACTTGGGTTCCATGAATCAAGCATGCATCCATGTGGAGGGCTCC 
CGCCAGGCTCTGACCGTGCAGGACTGGGCAGCCCAGCGGTGCACCATCTCCTACCGAGCC 
CCAGAGCTCTTCTCTGTGCAGAGTCACTGTGTCATCGATGAGCGGACTGATGTCTGGTCC 
CTAGGCTGCGTGCTATATGCCATGATGTTTGGGGAAGGCCCTTATGACATGGTGTTCCAA 
AAGGGTGACAGTGTGGCCCTTGCTGTGCAGAACCAACTCAGCATCCCACAAAGCCCCAGG 
CATTCTTCAGCATTGCGGCAGCTCCTGAACTCGATGATGAGCGTGGACCCGCATCAGCGT 
CCTCACATTCCTCTCCTCCTCAGTCAGCTGGAGGCGCTGCAGCCCCCAGCTCCTGGCCAA 
CATACTACCCAAATCTGA 

SEQ ID NO: 111_H05721_H 

CCCTGAGGCACCGCCCCAAGTTTGGTGTGACCGGCGGGGGACGCCGGTGGTGGCGGCAGC 
GACGGCTGCGGGGGCACCGGGCCGCGGCGCCACCATGGCGGTGCGACAGGCGCTGGGCCG 
CGGCCTGCAGCTGGGTCGAGCGCTGCTGCTGCGCTTCACGGGCAAGCCCGGCCGGGCCTA 
CGGCTTGGGGCGGCCGGGCCCGGCGGCGGGCTGTGTCCGCGGGGAGCGTCCAGGCTGGGC 
CGCAGGACCGGGCGCGGAGCCTCGCAGGGTCGGGCTCGGGCTCCCTAACCGTCTCCGCTT 
CTTCCGCCAGTCGGTGGCCGGGCTGGCGGCGCGGTTGCAGCGGCAGTTCGTGGTGCGGGC 
CTGGGGCTGCGCGGGCCCTTGCGGCCGGGCAGTCTTTCTGGCCTTCGGGCTAGGGCTGGG 
CCTCATCGAGGAAAAACAGGCGGAGAGCCGGCGGGCGGTCTCGGCCTGTCAGGAGATCCA 
GGCAATTTTTACCCAGAAAAGCAAGCCGGGGCCTGACCCGTTGGACACGAGACGCTTGCA 
GGGCTTTCGGCTGGAGGAGTATCTGATAGGGCAGTCCATTGGTAAGGGCTGCAGTGCTGC 
TGTGTATGAAGCCACCATGCCTACATTGCCCCAGAACCTGGAGGTGACAAAGAGCACCGG 
GTTGCTTCCAGGGAGAGGCCCAGGTACCAGTGCACCAGGAGAAGGGCAGGAGCGAGCTCC 
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FIGURE 2FFFF 

GGGGGCCCCTGCCTTCCCCTTGGCCATCAAGATGATGTGGAACATCTCGGCAGGTTCCTC 
CAGCGAAGCCATCTTGAACACAATGAGCCAGGAGCTGGTCCCAGCGAGCCGAGTGGCCTT 
GGCTGGGGAGTATGGAGCAGTCACTTACAGAAAATCCAAGAGAGGTCCCAAGCAACTAGC 
CCCTCACCCCAACATCATCCGGGTTCTCCGCGCCTTCACCTCTTCCGTGCCGCTGCTGCC 
AGGGGCCCTGGTCGACTACCCTGATGTGCTGCCCTCACGCCTCCACCCTGAAGGCCTGGG 
CCATGGCCGGACGCTGTTCCTCGTTATGAAGAACTATCCCTGTACCCTGCGCCAGTACCT 
TTGTGTGAACACACCCAGCCCCCGCCTCGCCGCCATGATGCTGCTGCAGCTGCTGGAAGG 
CGTGGACCATCTGGTTCAACAGGGCATCGCGCACAGAGACCTGAAATCCGACAACATCCT 
TGTGGAGCTGGACCCAGACGGCTGCCCCTGGCTGGTGATCGCAGATTTTGGCTGCTGCCT 
GGCTGATGAGAGCATCGGCCTGCAGTTGCCCTTCAGCAGCTGGTACGTGGATCGGGGCGG 
AAACGGCTGTCTGATGGCCCCAGAGGTGTCCACGGCCCGTCCTGGCCCCAGGGCAGTGAT 
TGACTACAGCAAGGCTGATGCCTGGGCAGTGGGAGCCATCGCCTATGAAATCTTCGGGCT 
TGTCAATCCCTTCTACGGCCAGGGCAAGGCCCACCTTGAAAGCCGCAGCTACCAAGAGGC 
TCAGCTACCTGCACTGCCCGAGTCAGTGCCTCCAGACGTGAGACAGTTGGTGAGGGCACT 
GCTCCAGCGAGAGGCCAGCAAGAGACCATCTGCCCGAGTAGCCGCAAATGTGCTTCATCT 
AAGCCTCTGGGGTGAACATATTCTAGCCCTGAAGAATCTGAAGTTAGACAAGATGGTTGG 
CTGGCTCCTCCAACAATCGGCCGCCACTTTGTTGGCCAACAGGCTCACAGAGT^AGTGTTG 
TGTGGAAACAAAAATGAAGATGCTCTTTCTGGCTAACCTGGAGTGTGAAACGCTCTGCCA 
GGCAGCCCTCCTCCTCTGCTCATGGAGGGCAGCCCTGTGATGTCCCTGCATGGAGCTGGT 
GAATTACTAAAAGAACATGGCATCCTCTGTGTCGTGATGGTCTGTGAATGGTGAGGGTGG 
GAGTCAGGAGACAAGACAGCGCAGAGAGGGCTGGTTAGCCGGAAAAGGCCTCGGGCTTGG 
CAAATGGAAGAACTTGAGTGAGAGTTCAGTCTGCAGTCCTCTGCTCACAGACATCTGAAA 
AGTGAATGGCCAAGCTGGTCTAGTAGATGAGGCTGGACTGAGGAGGGGTAGGCCTGCATC 
CACAGAGAGGATCCAGGCCAAGGCACTGGCTGT.CAGTGGCAGAGTTTGGCTGTGACCTTT 
GCCCCTAACACGAGGAACTCGTTTGAAGGGGGCAGCGTAGCATGTCTGATTTGCCACCTG 
GATGAAGGCAGACATCAACATGGGTCAGCACGTTCAGTTACGGGAGTGGGAAATTACATG 
AGGCCTGGGCCTCTGCGTTCCCAAGCTGTGCGTTCTGGACCAGCTACTGAATTATTAATC 
TCACTTAGCGAAAGTGACGGATGAGCAGTAAGTAAGTAAGTGTGGGGATTTAAACTTGAG 
GGTTTCCCTCCTGACTAGCCTCTCTTACAGGAATTGTGAAATATTAAATGCAAATTTACA 
ACTGCAGATGACGTATGTGCCTTGAACTGAATATTTGGCTTTAAGAATGATTCTTCTTAT 
ACTCTGAAGGTGAGAATATTTTGTGGGCAGGTATCAACATTGGGGAAGAGATTTCATGTC 
TAACTAACTAACTTTATACATGATTTTTAGGAAGCTATTGCCTAAATCAGCGTCAACATG 
CAGTAAAGGTTGTCTTCAACTGACAAAA 

SEQ ID NO: 1 12_AI 0868 65_H 

AATGAGATGGAGAAGTACGAGCGGATCCGAGTGGTGGGGAGAGGTGCCTTCGGGATTGTG 
CACCTGTGCCTGCGAAAGGCTGACCAGAAGCTGGTGATCATCAAGCAGATTCCAGTGGAA 
CAGATGACCAAGGAAGAGCGGCAGGCAGCCCAGAATGAGTGCCAGGTCCTCAAGCTGCTC 
AACCACCCCAATGTCATTGAGTACTACGAGAACTTCCTGGAAGACAAAGCCCTTATGATC 
GCCATGGAATATGCACCAGGCGGCACTCTGGCTGAGTTCATCCAAAAGCGCTGTAATTCC 
CTGCTGGAGGAGGAGACCATCCTGCACTTCTTCGTGCAGATCCTGCTTGCACTGCATCAT 
GTGCACACCCACCTCATCCTGCACCGAGACCTCAAGACCCAGAACATCCTGCTTGACAAA 
CACCGCATGGTCGTCAAGATCGGTGATTTCGGCATCTCCAAGATCCTTAGCAGCAAGAGC 
ACCCCATGCTATATCTCCCCTGAGCTGTGTGAGGGCAAGCCCTACAACCAGAAGAGTGAC 
ATCTGGGCCCTGGGCTGTGTCCTCTACGAGCTGGCCAGCCTCAAGAGGGCTTTCGAGGCT 
GCGAACTTGCCAGCACTGGTGCTGAAGATCATGAGTGGCACCTTTGCACCTATCTCTGAC 
CGGTACAGCCCTGAGCTTCGCCAGCTGGTCCTGAGTCTACTCAGCCTGGAGCCTGCCCAG 
CGGCCACCACTCAGCCACATCATGGCACAGCCCCTCTGCATCCGTGCCCTCCTCAACCTC 
CACACCGACGGCAGAGAAGTCCGTGGCCCCCAGCAACACAGGGAGCAGGACCACCAGTGT 
CCGCTGCAGAGAGGCATCATCATGACATTCGGCAGCGGCAGCAATGGGTGCCTAGGCCAT 
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FIGURE 2GGGG 

GGCAGCCTCACTGACATCAGCCAGCCCACCATTGTGGAGGCTTTGTTGGGCTATGAAATG 

GTGCAGCAAGTGGAGGAGGCCCTGAGCTTCACACTACTAGGCTCTGCACCCCTGGACCAG 

GAGCCTCTGCTGAGTATAGACCTGGGCACTGCTCACTCAGCTGCTGTGACTGGTGAGGAG 

GACTTGGGCTCTGGAGATGTAAACAGGTTACCCAGCTGGGAGAGAGGACATCTGCTGGCT 

GGTGTGGCGTCCAGCACTGATGTGTCTACCTTCTCTGAAGGTGACTGCAAGGAGCCTGAC 

AAGTGCTGCTGGAGACACAAGCAGTGCACTGGGCACATCATCTACCCTTTCGCCTCTGAC 

TGTGTCCGCCACAGCCTGCACCTACACTCTGTCAACCACTGCAACTGTAATTCTAGGCTG 

AAGGACTCTTCAGAGGATAGCAGCAGCTCCCGGGGCGCGGGCCCAACCTGCTCCCATGTC 

ATCGAGTCCCCTTGCTTTGAGCTCACACCGGAGGAGGAGCATGTGGAGCGATTCCGGTAT 

GGCTGGTGCAAAAGCTACAGACCTGTCTCTGTGGCAGTGATCCACCATCCACTCTACCAT 

GAGTGTGGGGCAGATGATCTAAATGXXAAGAAGAGGAAGAGGAGGAGGAGGAAAAGCAAG 

CCCCCCATCCCGACACAGGTGGGGCCCGCCACCGCCTCCCCTGACCTAGGCACCAGCATG 

GCCACTGGTACCCCTGACTCCACAGCGCCCATCACCATCTGGCGCTCTGAGAGCCCCACA 

GGGAAGGGTCAGGGCAGCAAGGTGATCAAGAAGGTAAAGAAGAAAAAGGAAAAAGAGAAA 

GACAAGGAGGAGATGGATGAGAAGGCAAAGCTGAAGAAAAAAGCCAAGAAAGGCCAGTTG 

ACTAAGAAGAAAAGCCCGGTTAAATTGGAGCCTTCCCCGCCAGACGTGAGCCGATCATTA 

AGCGCAAGACAGCTGGCCAGGATGTCCGAGTCCAGCCCAGAAAGCCGGGAAGAGCTGGAG 

AGCGAGGACAGTTACAATGGCCGGGGGCAGGGAGAACTGTCCAGCGAGGATATTGTGGAA 

TCATCATCGCCCAGGAAGAGAGAGAACACAGTCCAGGCCAAAAAGACAGGGGCAAAGCCC 

TCACAAGCCAGGAAGGTAAACAAGAGAAAATCTCCCCCAGGATCAAACCCCAACCTCAGT 

TGAGGCCAGGGTGGTCAGGGTGCAGAATAAATGCCATCGAGCCTGTGGCTGGCCCTCTGC 

TGCTGTTCTCTCCCTCCAACCTGGCTGTTTCTTGCGGGGCAAGGGGTGGGCTCAGGGCTG 

CAGGGGTTTCTCAAAGGCAATCCAGCTTTCACAAAGGAAGCCCATGGGAAGGCAGGTGGG 

AGGGAAAGGAAGGGGCACAGCCCTATTTCTTCCTACCTGCTAGGACAAGGTGGAAGAGTG 

TATCTGGGGTGGGAAGGAGGGCTTCCCCTCTCTGCTGCGAGAGACTGGTCTGTGTGAAAT 

CCACTTCTGGGACAGGCAGTACTGTCTGCAGCGATACCCCCAATAAACGGAACTTTTTAA 

ccc 

SEO ID NO: 113 AA836348 H 

ATGTCGGTGCTGGGCGAGTACGAGCGACACTGCGATTCCATCAACTCGGACTTTGGGAGC 

GAGTCCGGGGGTTGCGGGGACTCGAGTCCGGGGCCTAGCGCCAGTCAGGGGCCGCGAGCC 

GGCGGCGGCGCGGCGGAGCAGGAGGAACTGCACTACATCCCCATCCGCGTCCTGGGCCGC 

GGCGCCTTCGGGGAAGCCACGCTGTACCGCCGCACCGAGGATGACTCACTGGTTGTGTGG 

AAGGAAGTCGATTTGACCCGGCTGTCTGAGAAGGAACGTCGTGATGCCTTGAATGAGATA 

GTTATTCTGGCACTGCTGCAGCACGACAACATTATTGCCTACTACAATCACTTCATGGAC 

AATACCACGCTGCTGATTGAGCTGGAATATTGTAATGGAGGGAACCTGTATGACAAAATC 

CTTCGTCAGAAGGACAAGTTGTTTGAGGAAGAGATGGTGGTGTGGTACCTATTTCAGATT 

GTTTCAGCAGTGAGCTGCATCCATAAAGCTGGAATCCTTCATAGAGATATAAAGACATTA 

AATATTTTTCTGACCAAGGCAAACCTGATAAAACTTGGAGATTATGGCCTAGCAAAGAAA 

CTTAATTCTGAGTATTCCATGGCTGAGACGCTTGTGGGAACCCCATATTACATGTCTCCA 

GAGCTCTGTCAAGGAGTAAAGTACAATTTCAAGTCTGATATCTGGGCAGTTGGCTGCGTC 

ATTTTTGAACTGCTTACCTTAAAGAGGACGTTTGATGCTACAAACCCACTTAACCTGTGT 

GTGAAGATCGTGCAAGGAATTCGGGCCATGGAAGTTGACTCTAGCCAGTACTCTTTGGAA 

TTGATCCAAATGGTTCATTCGTGCCTTGACCAGGATCCTGAGCAGAGACCTACTGCAGAT 

GAACTTCTAGATCGCCCTCTTCTCAGGAAACGCAGGAGGTCAAGCACTGTGACTGAAGCA 

CCCATTGCTGTAGTAACATCACGAACCAGTGAAGTCTATGTTTGGGGTGGTGGAAAATCC 

ACCCCCCAGAAACTGGATGTTATCAAGAGTGGCTGTAGTGCCCGGCAGGTCTGTGCAGGG 

AATACCCACTTTGCTGTGGTCACAGTGGAGAAGGAACTGTACACTTGGGTGAACATGCAA 

OTAGGCACTAAACTCCATGGTCAGCTGGGCCATGGAGACAAAGCCTCCTATCGACAGCCA 

AAGCATGTGGAAAAGTTGCAAGGCAAAGCTATCCGTCAGGTGTCATGTGGTGATGATTTC 
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FIGURE 2HHHH 

ACTGTCTGTGTGACTGATGAGGGTCAGCTCTATGCGTTCGGATCAGATTATTATGGCTGC 
ATGGGGGTGGACAAAGTTGCTGGCCCTGAAGTGCTAGAACCCATGCAGCTGAACTTCTTC 
CTCAGCAATCCAGTGGAGCAGGTCTCCTGTGGAGATAATCATGTGGTGGTTCTGACACGA 
AACAAGGAAGTCTATTCTTGGGGCTGTGGCGAATATGGACGACTGGGTTTGGATTCAGAA 
GAGGATTATTATACACCACAAAAGGTGGATGTTCCCAAGGCCTTGATTATTGTTGCAGTT 
CAATGTGGCTGTGATGGGACATTTCTGTTGACCCAGTCAGGCAAAGTGCTGGCCTGTGGA 
CTCAATGAATTCAATAAGCTGGGTCTGAATCAGTGCATGTCGGGAATTATCAACCATGAA 
GCATACCATGAAGTTCCCTACACAACGTCCTTTACCTTGGCCAAACAGTTGTCCTTTTAT 
AAGATCCGTACCATTGCCCCAGGCAAGACTCACACAGCTGCTATTGATGAGCGAGGCCGG 
CTGCTGACCTTTGGCTGCAACAAGTGTGGGCAGCTGGGCGTTGGGAACTACAAGAAGCGT 
CTGGGAATCAACCTGTTGGGGGGACCCCTTGGTGGGAAGCAAGTGATCAGGGTCTCCTGC 
GGTGATGAGTTTACCATTGCTGCCACTGATGAGAAAGTATTGAATTCTAAGACCATCCGT 
TCCAATAGCAGTGGCTTATCCATTGGAACTGTGTTTCAGAGCTCTAGCCCGGGAGGAGGC 
GGCGGGGGCGGCGGTGGTGAAGAAGAGGACAGTCAGCAGGAATCTGAAACTCCTGACCCA 
AGTGGAGGCTTCCGAGGAACAATGGAAGCAGACCGAGGAATGGAAGGTTTAATCAGTCCC 
ACAGAGGCCATGGGGAACAGTAATGGGGCCAGCAGCTCCTGTCCTGGCTGGCTTCGAAAG 
GAGCTGGAAAATGCAGAATTTATCCCCATGCCTGACAGCCCATCTCCTCTCAGTGCAGCG 
TTTTCAGAATCTGAGAAAGATACCCTGCCCTATGAAGAGCTGCAAGGACTCAAAGTGGCC 
TCTGAAGCTCCTTTGGAACACAAACCCCAAGTAGAAGCCTCGGTAACTGAGCTTTTTGCC 
TTTGAATCACAACTAGTCACCTCGGCTGAATCCTGCAGTAACCTGTGCTGGGAAGGGAAC 
ACCACTGACTCCTCCTGCGTGTGCGTGCAGCTCTCTGCAGGTGGAGGTTGA 

SEQ ID NO: 114_R86668_H , MKK6__H 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTG 
GAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCAC 
TACACTTTTGCCCTCAACCGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 
CTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 
CGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAG 
GCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAAT 
GCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTA 
ATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTAT 
TACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTG 
GTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTG 
ATGGAGACCTTCCTGCTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA 
CCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACA 
GCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTG 
CCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTG 
GAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGA 
GTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAG 
GACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAG 
GCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAG 
ATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGC 
ACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATCAAG 
GAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCAC 
AGACGCCTGCGCCAGAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTAC 
CTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTG 
TGGGGACCCCTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTGCAG 
GGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGGGGACAATGTG 
CTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTCGGCACCTCCAAGCGGCTG 
GCAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAGTATATGGCCCCAGAA 
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FIGURE 2III1 

ATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGC 
ACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCT 
GCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCC 
GAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCC 
CAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCC 
CCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAAC 
TCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGC 
CCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCT 
GCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAG 
AGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAG 
AATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAA 
GAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAG 
GAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCAC 
AGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGT 
CCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGT 
GTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAAT 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAG 
GGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGC 
GAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGG 
CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACG 
GACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATG 
CTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACCTC 
ATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAG 
CGAGCAGGATCCACACCAGTCACCTCTGGACCCTGA 

SEQ ID NO: 115_PAK6_H 

ATGTTTGGGAAGAAAAAGAAAAAGATTGAAATATCTGGCCCGTCCAACTTTGAACACAGG 
GTTCATACTGGGTTTGATCCACAAGAGCAGAAGTTTACCGGCCTTCCCCAGCAGTGGCAC 
AGCCTGTTAGCAGATACGGCCAACAGGCCAAAGCCTATGGTGGACCCTTCATGCATCACA 
CCCATCCAGCTGGCTCCTATGAAGACAATCGTTAGAGGAAACAAACCCTGCAAGGAAACC 
TCCATCAACGGCCTGCTAGAGGATTTTGACAACATCTCGGTGACTCGCTCCAACTCCCTA 
AGGAAAGAAAGCCCACCCACCCCAGATCAGGGAGCCTCCAGCCACGGTCCAGGCCACGCG 
GAAGAAAATGGCTTCATCACCTTCTCCCAGTATTCCAGCGAATCCGATACTACTGCTGAC 
TACACGACCGAAAAGTACAGGGAGAAGAGTCTCTATGGAGATGATCTGGATCCGTATTAT 
AGAGGCAGCCACGCAGCCAAGCAAAATGGGCACGTAATGAAAATGAAGCACGGGGAGGCC 
TACTATTCTGAGGTGAAGCCTTTGAAATCCGATTTTGCCAGATTTTCTGCCGATTATCAC 
TCACATTTGGACTCACTGAGCAAACCAAGTGAATACAGTGACCTCAAGTGGGAGTATCAG 
AGAGCCTCGAGTAGCTCCCCTCTGGATTATTCATTCCAATTCACACCTTCTAGAACTGCA 
GGGACCAGCGGGTGCTCCAAGGAGAGCCTGGCGTACAGTGAAAGTGAATGGGGACCCAGC 
CTGGATGACTATGACAGGAGGCCAAAGTCTTCGTACCTGAATCAGACAAGCCCTCAGCCC 
ACCATGCGGCAGAGGTCCAGGTCAGGCTCGGGACTCCAGGAACCGATGATGCCATTTGGA 
GCAAGTGCATTTAAAACCCATCCCCAAGGACACTCCTACAACTCCTACACCTACCCTCGC 
TTGTCCGAGCCCACAATGTGCATTCCAAAGGTGGATTACGATCGAGCACAGATGGTCCTC 
AGCCCTCCACTGTCAGGGTCTGACACCTACCCCAGGGGCCCTGCCAAACTACCTCAAAGT 
CAAAGCAAATCGGGCTATTCCTCAAGCAGTCACCAGTACCCGTCTGGGTACCACAAAGCC 
ACCTTGTACCATCACCCCTCCCTGCAGAGCAGTTCGCAGTACATCTCCACGGCTTCCTAC 
CTGAGCTCCCTCAGCCTCTCATCCAGCACCTACCCGCCGCCCAGCTGGGGCTCCTCCTCC 
GACCAGCAGCCCTCCAGGGTGTCCCATGAACAGTTTCGGGCGGCCCTGCAGCTGGTGGTC 
AGCCCAGGAGACCCCAGGGAATACTTGGCCAACTTTATCAAAATCGGGGAAGGCTCAACC 
GGCATCGTATGCATCGCCACCGAGAAACACACAGGGAAACAAGTTGCAGTGAAGAAAATG 
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GACCTCCGGAAGCAACAGAGACGAGAACTGCTTTTCM 
Sc^CCATGACAATGTGGTTGACATGTACAGCAGCTA^ 

CTGGTCATGGAGTTTCTAGAAGGTGGTG^ 

AATGAAGAACAGATAGCTACTGTCTGCOT^^ 
AACCAAGGAGTGATTCACAGGGACATAAAAAGTGA^TCCATC^ 



AAATCATTGGTTGGCACTCCCTACTGGATGGCCC^CTG^ 



CCCCCCTACTTCAATGAGCCTCCCCT^^ CCGGGGAT TCCTAGACTTGATG 
CCAAGAGTGAAGGACCTACAGAAGGTTTCTTCAG^ TCCTCGGACA TCCATTCTTA 

?«GGAGCCCTCTCA« 



AAACTAGCAGGTCCACCGTCT1 
SEQ ID NO: 116-SURTK106 H GTOTACCATTA TAGCA 

OTCCACCTAGGGACCTAAGCTGGGAAGCAGGAa^ 

aSgatcggtggaaaac^g™ 

GTGCCGC<M<»GCAACTCTCTCAAGnCTGGAGCAGAT GTGTTRTTC TCAAG 

»tctttcgagccaatatgaacactggggaccc^™ CCRATTC 

GCT^AAAAGAACCAGCTGGGCTCCATGAGGTACA^GATTTCTT 
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GAACATTCAACATGTATTGTTCATTAAGCTAGCTTCCTAGTTCCGATTAGACTAAGGAGA 
CTAAGCCTAGAGAGTCAATGTTAGAACAGTGAAAAGAATTCTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGCACAATAAATAGGAAATGTAGAAACCAAGCAAGAAGGCTTAGTAGCTCA 
GTCTTTAACAAGGGCTAGAAAAGAATGTAATCTGATATGGAAGGATAGCAGCTTCTAATT 
TTCAATCATCTGTTGATATACTGTGAAACTTATTTTATTAAATTAATATTTATTAAATGG 

SEQ ID NO: 117_AA098024_M 

CTGCAGGAGAAGCACCTGTTTCATGGGGATGTGGCTGCCAGGAACATCCTGATCCAAAGT 
GACCTGACTCCCAAACTTTGTCATCTGGGCCTGGCTTATGAAGTTCATGCCCATGGGGCC 
ATCTCCTCTGCTCGATCCAGCACCATCCCTCTCAAGTGGCTTGCTCCAGAAAGGCTTCTC 
CTGAGACCTGCAAGCATCAGGGGAGATATTTGGTCCTTTGGGATCCTGCTTTATGAGATG 
GTGACTCTAGGAGCACCACCATACCCTGAAGTCCCTCCCACCAGCATCCTACAATATCTT 
CAGAGAAAGAAAATCATGAAGAGACCCAGCAGCTGCTCACATGCCATGTACAACATCATG 
AAGTGCTGTTGGCGCTGGAGTGAGGACAGCCGCCCCTTACTTGTTCAGCTGCTCCAGCGC 
CTAGAAGCTGCTTCTAGATCTGCCGATGACAAGGCTGTGTTGCAAGTGCCAGAGTTGGTG 
GTGCCTGAACTGTATGCAGATGTGGCTGGCATCAGGGCAGAAAGCATTTCCTATAGCTTC 
AGTGTCCTTTGAAGATGGTCCTAGACAAATGACTATATATGGGTGGAATTAGTTCCTTCA 
AGAACAGAGAGAAGGAACTTTCTGTGGCCCACCAAGGGAGAAAAAAGGACATGGATCTTG 
CATCTTTCCCTAAACATTTTCCTAGACATCTGAAATGCTGCTGGATGAAGCTCTACCTCT 
ACATACCATGTACTCTTGAGCTAAGAATCACCATCAATTGTAGTTTGCTTTCCAGTCCCA 
AGGGCTGAAGTATAAGTGGTGGACCGTGTCATTCTAAAGGAGGTTTTTAAAATCTGCAAT 
GATTGTAAGGGAATTAGGCAAAAGGGCTGGTCCCACTCACTCCAGGCTGGTTTACTACTG 
AAACTAGTTTTTCTTTTCTTTTTTTTTAAGTTAAACTATTACAGAGTAAAAATAAACCAG 
ATGGGCATGAATGAACACCTTCTAATTTTTAACCATGAATTGAATATTGGAATTCATGAG 
AAAGAAAATTCTAGGTTCTTTTTGCTAAGAGGTGTTAAGGTGAGTCAATATATCCTTCAA 
GGAAAGGCTTTGTCTCATCTATGTTGACGGGACGTAAAAGTCCTCGTCCCGTTATGAAGA 
GCTGAAGAAGATCTATAAGAAACAATACTGAGCCTTTCCTTGACTATAGATAGAAGAGCA 
TCCTTTCATTGAACTCTGAGGCAGGTGGACCATGCATGATACTAAGTTTAATTAGAAGCA 
GGAGGAGTCATTTCATGATTAGGAACATTGTTCATCCCATTTGTTTGCCAGTTCCTGTAA 
GACTAAGGAGAATCAGCCTATAGAGCCAAAGCTAGAACCAGGGATAAAAAGTGTGTGTGT 
GTATAACAAATAGGAAGCATGAAAGTCGAGCAAGAAGACTTAGTAACCCAGGTGGTCATT 
AAGAGGTACAGAGAAGAAGTAATCTTATAGGAATGGATGGTAGCTTCTAATTTTTAACCA 
TTCATTGAAATAACTGTGAAGCAACTCATTAAACTAGTATTTATTGACCAAAAGTAGACT 
TTTCAGGTGTATAGCTGCCAAAATCTCTATAATAAAGAGGCTAAAAGAAAATAAATGGGA 
GTTATTTTACTAGGAAAATTAGAGAACCTATAGTTTCCAAAAAGAGATTCTTTATGTGCA 
AAATGAGATAACTCTCTACCTCACAGGGTTGGTGTGAGGAACAATGAGAATATGTATTTG 
TGTATTATGTAGAATATAATATATTCTCAATAAATACTAGTTTTTCCCCTTTC 

SEQ ID NO: 118_SGK2ALPHA_H 

GAAGAGGGCAGAGCCGTGCATGGGGCTGCTCCCCAGGACCTGAGCAGGAACCTGGAGTTT 
TCAGAGCTGCCTGATCATTGCTACAGAATGAACTCTAGCCCAGCTGGGACCCCAAGTCCA 
CAGCCCTCCAGGGCCAATGGGAACATCAACCTGGGGCCTTCAGCCAACCCAAATGCCCAG 
CCCACGGACTTCGACTTCCTCAAAGTCATCGGCAAAGGGAACTACGGGAAGGTCCTACTG 
GCCAAGCGCAAGTCTGATGGGGCGTTCTATGCAGTGAAGGTACTACAGAAAAAGTCCATC 
TTAAAGAAGAAAGAGCAGAGCCACATCATGGCAGAGCGCAGTGTGCTTCTGAAGAACGTG 
CGGCACCCCTTCCTCGTGGGCCTGCGCTACTCCTTCCAGACACCTGAGAAGCTCTACTTC 
GTGCTCGACTATGTCAACGGGGGAGAGCTCTTCTTCCACCTGCAGCGGGAGCGCCGGTTC 
CTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGTGGCCAGCGCCATTGGCTACCTGCAC 
TCCCTCAACATCATTTACAGGGATCTGAAACCAGAGAACATTCTCTTGGACTGCCAGGGA 
CACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGAAGGTGTAGAGCCTGAAGACACCACA 
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TCCACATTCTGTGGTACCCCTGAGTACTTGGCACCTGAAGTGCTTCGGAAAGAGCCTTAT 
GATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGTCCTCTACGAGATGCTCCATGGCCTG 
CCGCCCTTCTACAGCCAAGATGTATCCCAGATGTATGAGAACATTCTGCACCAGCCGCTA 
CAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGACCTCCTGCAAAGCCTTCTCCACAAG 
GACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTTTCTTGAGATTAAGAACCATGTATTC 
TTCAGCCCCATAAACTGGGATGACCTGTACCACAAGAGGCTAACTCCACCCTTCAACCCA 
AATGTGACAGGACCTGCTGACTTGAAGCATTTTGACCCAGAGTTCACCCAGGAAGCTGTG 
TCCAAGTCCATTGGCTGTACCCCTGACACTGTGGCCAGCAGCTCTGGGGCCTCAAGTGCA 
TTCCTGGGATTTTCTTATGCGCCAGAGGATGATGACATCTTGGATTGCTAGAAGAGAAGG 
ACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTAAGGAATTACCTTCAGCTGCTAGGAA 
GAGCGACTCAAACTAACAATGGCTTCAACGAGAAGCAGGTTTATTTTTTCCAGCACATAA 
AAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAGGACAGGTCATCAGATACTCAGAGGC 
TGTATCTCTGCCCTGCCAACCTTGACAAATGGCTTCCAATGTTAGGTTTGCTACAAGATG 
GTTACTGGAGCTCTAGCTGCCTATTTTGTGTTTAGGGAAGGGAAAATGGAGGAAAGGGGA 
GAAGAGCAAAGGGCGCTTTT7VAAGAGCTTTCCCAAAAGCTCCCCCCAATGACTTTTGCTT 
CCATCTCACTAACCACCCACCCCTACCTGGAATGGAGGCTGGGAAATGTGGCTTATTTGC 
TGGGTACGTGACTATCCCTAATAACAAAGGGGTTTTGACCCTAAGACATTAGGGGAGAAT 
GTTGGGTAGGCAGCCAGCCCTCTTTTACCATAGGGCCTCCTGGTGTTTGGATTTTGATCT 
CAATGTGTAAAATGACAGAGATGTAACAAGCTCATAGGGTATCAATATCTCTTATTGTTC 
TATGTTGAAAAA 

SEQ ID NO: 12 0_CCRK_H 

ATGGACCAGTACTGCATCCTGGGCCGCATCGGGGAGGGCGCCCACGGCATCGTCTTCAAG 
GCCAAGCACGTGGAGACTGGCGAGATAATTGCCCTCAAGAAGGTGGCCCTAAGGCGGTTG 
GAAGACGGCTTCCCTAACCAGGCCCTGCGGGAGATTAAGGCTCTGCAGGAGATGGAGGAC 
AATCAGTATGTGGTACAACTGAAGGCTGTGTTCCCACACGGT'GGAGGCTTTGTGCTGGCC 
TTTGAGTTCATGCTGTCGGATCTGGCCGAGGTGGTGCGCCATGCCCAGAGGCCACTAGCC 
CAGGCACAGGTCAAGAGCTACCTGCAGATGCTGCTCAAGGGTGTCGCCTTCTGCCATGCC 
AACAACATTGTACATCGGGACCTGAAACCTGCCAACCTGCTCATCAGCGCCTCAGGCCAG 
CTCAAGATAGCGGACTTTGGCCTGGCTCGAGTCTTTTCCCCAGACGGCAGCCGCCTCTAC 
ACACACCAGGTGGCCACCAGGTCTGTGGGCTGCATCATGGGGGAGCTGTTGAATGGGTCC 
CCCCTTTTCCCGGGCAAGAACGATATTGAACAGCTTTGCTATGTGCTTCGCATCTTGGGC 
ACCCCAAACCCTCAAGTCTGGCCGGAGCTCACTGAGCTGCCGGACTACAACAAGATCTCC 
TTTAAGGAGCAGGTGCCCATGCCCCTGGAGGAGGTGCTGCCTGACGTCTCTCCCCAGGCA 
TTGGATCTGCTGGGTCAATTCCTTCTCTACCCTCCTCACCAGCGCATCGCAGCTTCCAAG 
GCTCTCCTCCATCAGTACTTCTTCACAGCTCCCCTGCCTGCCCATCCATCTGAGCTGCCG 
ATTCCTCAGCGTCTAGGGGGACCTGCCCCCAAGGCCCATCCAGGGCCCCCCCACATCCAT 
GACTTCCACGTGGACCGGCCTCTTGAGGGAGTCGCTGTTGAACCCAGAGCTGATTCGGCC 
CTTCATCCTGGAGGGGTGAGAAGTTGGCCCTGGTCCCGTCTGCCTGCTCCTCAGGACCAC 
TCAGTCCACCTGTTCCTCTGCCACCTGCCTGGCTTCACCCTCCAAGGCCTCCCCATGGCC 
ACAGTGGGCCCACACCACACCTTGCCCCTTAGCCCTTGCGAGGGTTGGTCTCGAGGCAGA 
GGTCATGTTCCCAGCCAAGAGTATGAGAACATCCAGTCGAGCAGAGGAGATTCATGGCCT 
GTGCTCGGTGAGCCTTACCTTCTGTGTGCTACTGACGTACCCATCAGGACAGTGAGCTCT 
GCTGCCAGTCAAGGCCTGCATATGCAGAATGACGATGCCTGCCTTGGTGCTGCTTCCCCC 
GAGTGCTGCCTCCTGGTCAAGGAGAAGTGCAGAGAGTAA 

SEQ ID NO: 121_TESK2_H 

GAATTCGCGGCCGCTCGACGCTCAGCAGAGCTACCAGCTGCCCTGTTGGCTTCGCTGGTC 
GGATCGTCCTCCTGGCCCCGCCAAACAGGCGAGCGGCCCCGACTGTGGGGCATGGCAGTA 
GTCTCCTCGTTCTCCGCCGCCGCTAGCCTAGCTGAGTCGCCGGCTTCTGCGCTAGGGGCT 
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CCCACCGCCTCCGCAGGCTAAGGAGCCGCTGCCACCAACGAGCTGTGAGGGTTACTATGC 

TCCCTCTTTGCCGCCGTCTCCTCCTCTTGCCCGCGCAGGCACCCCTCTGGCTGCTCAGTC 

CTGCCTCAGTGTCAAACCAGAAGAGAAGTAAAATTCAACAAAAATTTATGTGTGGAGTTC 

CTTCTTAAAAGAAGAAAAAAGTGATTATTTAGACTATGGATCGGAGCAAACGGAATTCAA 

TTGCAGGATTTCCTCCACGTGTGGAGCGTCTTGAAGAGTTTGAAGGAGGTGGTGGAGGAG 

AAGGAAATGTGAGCCAGGTGGGAAGAGTTTGGCCATCTTCGTATCGAGCTCTTATAAGTG 

CCTTTTCCAGACTGACGCGTTTGGATGATTTCACCTGTGAAAAAATAGGGTCTGGCTTCT 

TTTCTGAAGTGTTCAAGGTACGACACCGAGCTTCTGGTCAGGTGATGGCTCTTAAGATGA 

ACACATTGAGCAGTAACCGGGCAAACATGCTGAAAGAAGTACAGCTCATGAATAGACTCT 

CCCATCCCAACATCCTTAGGTATATCAACTCCGGGAACCTGGAACAGTTGCTAGACAGTA 

ACCTGCATTTGCCTTGGACTGTGAGGGTAAAACTGGCCTATGACATAGCAGTGGGCCTCA 

GCTACCTTCACTTCAAAGGCATTTTTCATCGGGACCTCACATCTAAGAACTGCCTGATAA 

AGAGGGATGAGAATGGTTACTCTGCAGTGGTAGCTGACTTTGGCCTGGCTGAGAAGATCC 

CCGATGTCAGCATGGGGAGTGAGAAGCTGGCCGTGGTGGGTTCCCCATTCTGGATGGCAC 

CTGAGGTTCTCCGAGATGAGCCCTATAATGAAAAGGCAGATGTGTTCTCTTATGGTATCA 

TCCTCTGCGAGATCATCGCCCGCATCCAGGCCGATCCGGACTATCTTCCCCGCACAGAGA 

ATTTCGGGCTGGACTATGATGCTTTCCAGCACATGGTGGGAGACTGTCCCCCAGATTTTC 

TGCAACTTACTTTCAACTGCTGTAACATGGATCCCAAACTGCGCCCATCTTTTGTGGAGA 

TTGGGAAGACCCTGGAGGAAATTCTGAGCCGCCTACAGGAAGAAGAGCAGGAGAGGGATA 

GGAAGCTGCAGCCCACAGCCAGGGGACTCTTGGAGAAAGCACCTGGGGTGAAGCGACTAA 

GCTCACTGGATGACAAGATCCCCCACAAGTCACCATGCCCAAGACGTACCATCTGGCTGT 

CTCGAAGCCAGTCAGAlATCTTTTCCCGTAAGCCCCCACGTACAGTGAGTGTCTTGGACC 

CATACTACCGGCCACGAGATGGTGCTGCCCGCACCCCCAAAGTCAACCCTTTTAGTGCTC 

GCCAGGACCTCATGGGGGGCAAGATCAAGTTTTTTGACCTGCCCAGCAAGTCTGTCATCT 

CTCTGGTATTTGACCTGGATGCACCAGGGCCCGGAACTATGCCCCTGGCTGACTGGCAGG 

AGCCCCTGGCCCCACCTATTCGCCGGTGGCGTTCCTTGCCTGGTTCGCCTGAGTTCTTGC 

ATCAAGAGGCTTGTCCATTTGTGGGCCGGGAAGAATCGCTATCTGATGGGCCCCCACCAC 

GCCTAAGTAGTCTCAAGTACAGAGTTAAAGAGATCCCACCATTCCGGGCATCTGCCCTAC 

CAGCTGCTCAAGCCCATGAGGCTATGGACTGCTCCATTCTCCAGGAAGAAAATGGTTTTG 

GGTCCAGGCCCCAGGGGACCAGTCCATGCCCTGCGGGTGCTTCTGAGGAGATGGAGGTAG 

AAGAAAGGCCAGCAGGCTCAACTCCAGCCACCTTCTCCACCTCAGGCATAGGCCTGCAAA 

CCCAGGGAAAGCAGGATGGGTGAGGGGGTTTAGTCCCTGCCTCACCTTGGGGATGGACCT 

TCAGCTGAAACCATATGGCCCCCTAGGTGCACAGCCTTGATTCTTCCCTGGAGCCTACAG 

AGCAGGCAGGCTAGGCCAAGCCAGGCTCAACTTCTGGGCTCCCAGTGCCCATTGGCTGTG 

TATGACGGGAGGCAGCAGTGAGAGGCCTTCCTAGTTAGGGCCAACAGCTGATACCAAGCC 

TCTGAAATCCAGCAAGGAGGTCTGCCTCCCACCAGACCCTCTCCAGTGTACTTCCCCAGA 

TAGGACCAGAGGATGTCTAGTTCTAGGCTGAGCTGGCAGGCAGCTATTACCCCGGTTCTT 

TCCCCACCCCAGGTCTGTCTCTTGCCTTTTCTTGGGGCATATAAGCTACTGAGTGGAACA 

TGGAGCTGATCAAGAGGCCGTAATGGTCATGGCTGTTTCCAGACCTGAATATTGGGTGCT 

TCTTGCCAGTATTCTAAGACATTTGAGTAATTGCTGTTTGCACTTACTGCATGGTCAGAC 

CACGTCACTACATTTCTATGCAAGGGGACAGCAAGGCAGCGTGGTGGTCATGGCTCTTAG 

CTAACCTATTCAAAGACCTTTTCCTGTTGATTAATCTATTTTCATATTTATAAAGGAGTC 

TTAATGTTCTGCCCCATAAGACTTTCAACCTTGTGGTTGGGAGTGGGGCTGGTTTTGTAG 

GCCCTAGGGCCTGCTTCTATGTATTTATCAACATGTGATACATTCAATTGGTTAAATGGT 

TTATACAGGGACTGATTTGCTTCCCTTCCTGCCATGGCTGGAGCTTTGGGAACAGTCTGT 

CCTTACAGAGCTGCAATAAGAAATAACCAAAGATGAAGCTGGTCAAATATTTTCATAACT 

TGCTTCTGTTGATTTTTTTTTTGTAAAACTTTCCCAAGACATTTTCAGACTTAAAAATAA 

AGTCAGTGTTACAGGT 
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BIOCHIMICA ET BI0PHYSICA ACTA, 






vol. 1450, no. 1, 6 May 1999 (1999-05-06), 
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figure 1 
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abstract 
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serine/threonine-protein kinase COT-1." 

XP002165848 

abstract 

DOC. AGAINST INV. 12 (SEQ.IDs. 133, 15) 

W0 99 57144 A (INCYTE PHARMA INC 

; PATTERSON CHANDRA (US); AZIMZAI YALDA 
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-/- 
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abstract 
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1,2,4-7, 
10-15 


X 


DATABASE EMBL 'Online! 

Accession Number AI215680, 
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abstract 
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SEQ.IDs. 5 and 6 

DOC. AGAINST INV. 15 (SEQ.IDs. 136, 18) 
DOC. AGAINST INV. 16 (SEQ.IDs. 137, 19) 


1-15, 

26-30, 

35-38 


X 


EP 0 861 896 A (DADE BEHRING MARBURG GMBH) 
2 September 1998 (1998-09-02) 

SEQ.IDs. 1 and 2 

DOC. AGAINST INV. 15 (SEQ.IDs. 136, 18) 
DOC. AGAINST INV. 16 (SEQ.IDs. 137, 19) 


1-15, 

26-28, 

35-38 
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glucocorticoid-dependent protein kinase 

(Sgk) mRNA. " 

XP002165851 

abstract 
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DOC. AGAINST INV . 113 (SEQ.IDs. 234, 116) 


1-14,26, 

71 ?Q ^0 


X 


DATABASE EMBL 'Online! 

Accession Number Z98752, 

23 August 1997 (1997-08-23) 

RAMSAY H.: Human unm sequence Trom cione 

RP1-138B7 on chromosome 20ql3.12." 

XP002165852 

abstract 

DOC. AGAINST INV. 17 (SEQ.IDs. 138, 20) 


1-14 


X 


W0 98 31802 A (GENETICS INST) 
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12 
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abstract 
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1,2,4,6, 
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Accession Number AA383293, 
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ADAMS M. D. ET AL. : "Homo sapiens cDNA 
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XP002165854 
abstract 
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1,2,4,6, 
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MARRA M. ET AL.: "Mus musculus cDNA clone 
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XP002165855 

abstract 
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-/~ 
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1,2,4,6, 
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;MURAMATSU MASAAKI (JP); TOKUMITSU HIROSHI 






(JP) 7 October 1999 (1999-10-07) 






page 41 -page 51 






00C. AGAINST INV. 24 (SEQ.IDs. 145, 27) 




X 




SANJO H. ET AL: "DRAKs , novel 


1-14,16, 




serine/threonine kinases related to 


35,37 




death-associated protein kinase that 






trigger apoptosis." 






JOURNAL OF BIOLOGICAL CHEMISTRY, 






vol. 273, no. 44, 






ou ucxoDer lyyo ^ iyyo iu ju;, payeb 






29066-29071, XP0O2165841 






ISSN: 0021-9258 






figure 1 






DOC. AGAINST INV. 26 (SEQ.IDs. 147, 29) 




P,X 




W0 99 33961 A (AKIRA SHIZU0 ;KAWAI TAR0 


1-14,16, 


(JP); ASAHI CHEMICAL IND (JP)) 


26,27, 




8 July 1999 (1999-07-08) 


29,30, 




35 37 




SEQ.IDs. 5 and 6 






DOC. AGAINST INV . 26 (SEQ.IDs. 147, 29) 






SEQ.IDs. 29 and 30 






DOC. AGAINST INV. 40 (SEQ.IDs. 161, 43) 




X 




NAGASE T. ET AL.: "PREDICTION OF THE 


1,2,4-7, 




CODING SEQUENCE OF UNIDENTIFIED HUMAN 


10-14, 




GENES. XIII. THE COMPLETE SEQUENCE OF 100 


16,19 




NEW CDNA CLONES FROM BRAIN WHICH CODE FOR 






LARGE PROTEINS IN VITRO" 






DNA RESEARCH, 






vol. 6, 26 February 1999 (1999-02-26), 






pages 63-70, XP000952912 






ISSN: 1340-2838 






the whole document 






-& DATABASE EMBL 'Online! 






Accession Number AB023153, 






9 April 1999 (1999-04-09) 






NAGASE T. ET AL.: "Homo sapiens mRNA for 






KIAA0936 protein." 






XP002166242 






abstract 






DOC. AGAINST INV. 54 (SEQ.IDs. 175, 57) 






-& DATABASE EMBL 'Online! 






Accession Number AB023216, 






9 April 1999 (1999-04-09) 






MATACC T FT At • "Unmn canicnc mRNA fnr 
NMbAot 1 . tl nL . . nOlTlO bap 1 Glib llirvlitt TUl 






KIAA0999 protein." 






XP002166243 






abstract 






DOC. AGAINST INV. 28 (SEQ.IDs. 149, 31) 






-/-- 





Form PCT/lSA/210 (continuation ot second sheet) (July 1992) 



page 7 of 23 



INTERNATIONAL SEARCH REPORT 



Ini .tionet Application No 

PCT/US 00/14842 



C.<Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • 



Citation of documenl. with indication. where appropriate, ot the relevant passages 



Relevant lo claim No. 



WO 98 01756 A (UNIV WASHINGTON) 
15 January 1998 (1998-01-15) 

SEQ.IDs. 1 and 2 

DOC. AGAINST INV. 30 (SEQ.IDs. 151, 33) 

DATABASE EMBL 'Online! 

Accession Number W90839, 

9 July 1996 (1996-07-09) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE : 420441 5' similar to gb:M80359 

putative serine/threonine protein kinase." 

XP002165856 

abstract 

DOC. AGAINST INV. 31 (SEQ.IDs. 152, 34) 

NAGASE T. ET AL. : "PREDICTION OF THE 
CODING SEQUENCES OF UNIDENTIFIED HUMAN 
GENES. IV. THE CODING SEQUENCES OF 40 NEW 
GENES (KIAA0121-KIAA0160) DEDUCED BY 
ANALYSIS OF CDNA CLONES FROM HUMAN CELL 
LINE KG-1" 
DNA RESEARCH, 
vol . 2, no. 4, 

31 August 1995 (1995-08-31), pages 

167-174, XP000676653 

ISSN: 1340-2838 

the whole document 

-& DATABASE EMBL 'Online! 

Accession Number D50925, 

1 August 1996 (1996-08-01) 

NAGASE T. ET AL.: "Human mRNA for KIAA0135 

gene. " 

XP002166244 

abstract 

DOC. AGAINST INV. 32 (SEQ.IDs. 153, 35) 
DOC. AGAINST INV. 33 (SEQ.IDs. 154, 36) 

DATABASE EMBL 'Online! 

Accession Number U79240, 

14 December 1996 (1996-12-14) 

YU W. ET AL.: "Human serine/threonine 

kinase mRNA." 

XP002165857 

abstract 

DOC. AGAINST INV. 33 (SEQ.IDs. 154, 36) 

-/-- 



1-14,16, 

26,27, 

35,36 



1,2,4,6, 
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Accession Number AI036899, 

ess uuric i3?o v i uo 4?; 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE:1746011 5' similar to TR:Q99763 

serine/threonine protein kinase." 

XP002165858 

abstract 

DOC. AGAINST INV. 33 (SEQ.IDs. 154, 36) 




1,2,4,6, 
7,10-13 


X 


DATABASE EMBL 'Online! 

Accession NUniDer Ai**oyuj«3, 

17 March 1999 (1999-03-17) 

STRAUSBERG R.: "Homo sapiens cDNA clone 

IMAGE : 2137322. " 

XP002165859 

abstract 

DOC. AGAINST INV. 35 (SEQ.IDs. 156, 38) 




6,7 


X 


DATABASE EMBL 'Online! 

Accession NumDer aluu/^^d, 

7 April 1999 (1999-04-07) 

BRUCE D. ET AL.: "Homo sapiens chromosome 

16 clone RPCI-11_480G7. " 

XP002166245 

abstract 

DOC. AGAINST INV. 37 (SEQ.IDs. 158, 40) 




6,7 


P,X 


WO 99 49062 A (FAN WUFANG ;GENE LOGIC INC 
(US); PRASHAR YATINDRA (US)) 
30 September 1999 (1999-09-30) 
SEQ.IDs. 1 and 2 

DOC. AGAINST INV. 37 (SEQ.IDs. 158, 40) 
DOC. AGAINST INV. 55 (SEQ.IDs. 176, 58) 




1-14, 

26-30, 

35-38 


X 


DATABASE EMBL 'Online! 

Accession Number AI596766, 

26 April 1999 (1999-04-26) 

MARRA M. ET AL.: "Mus musculus cDNA clone 

IMAGE:949322 5' similar to WP:ZC373.4 

CE02377 myosin-1 ight-chain kinase domain." 

XP002166246 

abstract 

DOC. AGAINST INV. 38 (SEQ.IDs. 159, 41) 
DOC. AGAINST INV. 39 (SEQ.IDs. 160, 42) 

-/~ 




1,2,4,6, 
7,10-13, 
16 
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NAGASE T. ET AL.: "PREDICTION OF THE 
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16,25 
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DNA RESEARCH, 






vol. 7, 28 February 2000 (2000-02-28), 






pages 65-73, XP000923011 






KIAA1297 






-& DATABASE EMBL 'Online! 






Accession Number AB037718, 






14 March 2000 (2000-03-14) 






OHARA 0. ET AL. : "Homo sapiens mRNA for 






KIAA1297 protein." 






XP002166247 






abstract 






DOC. AGAINST INV. 38 (SEQ.IDs. 159, 41) 






DOC . AGAINST INV. 39 (SEQ.IDs. 160, 42) 






-& DATABASE EMBL 'Online! 






Accession Number AB037759, 






14 March 2000 (2000-03-14) 






OHARA 0. ET AL.: "Homo sapiens mRNA for 






KIAA1338 protein." 






XP002167889 






abstract 






DOC. AGAINST INV. 67 (SEQ.IDs. 188, 70) 






-& DATABASE EMBL 'Online! 






Accession Number AB037790, 






14 March 2000 (2000-03-14) 






OHARA 0. ET AL.: "Homo sapiens mRNA for 






KIAA1369 protein." 






XP002167890 






abstract 






DOC. AGAINST INV. 68 (SEQ.IDs. 189, 71) 






-& DATABASE EMBL 'Online! 






Accession Number AB037781 , 






14 March 2000 (2000-03-14) 






II A r> A r *T T » 1 till rt ft 1 A /" ^ 

NAGASE T. ET AL.: Homo sapiens mRNA for 






KIAA1360 protein, partial cds." 






XP002168226 






abstract 






DOC. AGAINST INV. 82 (SEQ.IDs. 203, 85) 




X 


KAWAI T. ET AL.: "Duet is a novel 


1-14,29, 




serine/threonine kinase with Dbl-Homology 


30,35-38 




(DH) and Pleckstrin-Homology (PH) domains" 






GENE, 






vol. 227, no. 2, 






18 February 1999 (1999-02-18), pages 






249-255, XP004158739 






ISSN: 0378-1119 






the whole document 
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abstract 
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clone IMAGE:730582 5.' similar to gb:X66363 
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ArUuYl00.cbu 






abstract 
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Box I Observations where certain claims were found unsearchable (Continuation of Hem 1 of first sheet) 

This International Search Report has not been established in respect oi certain claims under Article 17(2)(a) tor the to! lowing reasons: 
1. [[I Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. PH Claims Nos.: 31-34 

bje*JsFUfeJ Wife tJWSifiyWr1ft>'iIrOWna3b8«^p^i£ali^rI fcA/cfcKIt comply with the prescribed requirements to such 
an extent thai no meaningful International Search can be carried out. specifically: 



3. | | Claims Nos.: 

because they are dependent claims and are not dratted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this International application, as follows: 

see additional sheet 



1 . I I As all required additional search tees were timely paid by the applicant, this international Search Report covers all 
1 — 1 searchable claims. 

2. | 1 As all searchable claims could be searched without effort justifying an additional tee. this Authority did not invite payment 

of any additional tee. 

X 

3 □ mw^OT^ 

(completely), 23-38 (partially) 



4. | | No required additional search tees were timely paid by the applicant. Consequently, this international Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | X | The additional search tees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search tees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: Invention 1: Claims 1-14,26-38 (all partially) 

A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 122 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEO. ID. 122 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEO. ID. 122. 



2. Claims: Inventions 2-78: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs. 123-199, wherein invention 2 
is limited to SEO. ID. 123, invention 3 is limited to SEO. ID. 

124, , and invention 78 is limited to SEQ. ID. 199, or a 

domain thereof; a vector and a recombinant cell comprising 
said nucleic acid molecule; a nucleic acid probe for the 
detection of said nucleic acid molecule; a polypeptide as 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199 or a fragment thereof; an antibody or 
antibody fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a kinase polypeptide 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199. 



3. Claims: Invention 79: Claim 21 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 200 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ. ID. 200 or a 
fragment thereof; an antibody or antibody fragment having 
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specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ.ID.200. 



4. Claims: Invention 80: Claim 22 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ.ID.201 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ.ID.201 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ.ID.201. 



5. Claims: Inventions 81-121: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs. 202-242, wherein invention 81 
is limited to SEQ.ID. 202, invention 82 is limited to 

SEQ.ID. 203, , and invention 121 is limited to 

SEQ.ID. 242, or a domain thereof; a vector and a recombinant 
cell comprising said nucleic acid molecule; a nucleic acid 
probe for the detection of said nucleic acid molecule; a 
polypeptide as represented by a polypeptide sequence 
selected from SEQ. IDs .202-242 or a fragment thereof; an 
antibody or antibody fragment having specific binding 
affinity to said polypeptide; a hybridoma which produces 
said antibody; methods for identifying a substance that 
modulates the kinase activity of said polypeptide; methods 
for treating a disease or disorder by administering such a 
substance; and methods for the detection of a kinase 
polypeptide represented by a polypeptide sequence selected 
from SEQ. IDs. 202-242. 



6. Claims: Inventions 122-136: Claims 15-20, 

23-25 (all partially) and claims 1-14, 
26-38 (if applicable) 
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A nucleic acid molecule encoding a kinase polypeptide as 
represented by a 'gene name' selected from 'AA980090', 
'AA045601', 'AA297313', 'N23936', '5R72-18-1', '5R79-54-1', 
'5R65-16-1', 'AA065538', 'H17727', 'W08549\ 'AA430250', 
'AA139478', 'R87679', 'W65887', 'AA948538' , '5R69-23-3', and 
'5R69-26-2', wherein invention 122 is limited to 'AA980090', 

invention 123 is limited to 'AA045601', and invention 

136 is limited to '5R69-26-2', or a domain thereof; a vector 
and a recombinant cell comprising said nucleic acid 
molecule; a nucleic acid probe for the detection of said 
nucleic acid molecule; a polypeptide encoded by said nucleic 
acid molecule or a fragment thereof; an antibody or antibody 
fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a said polypeptide. 
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Continuation of Box 1.2 
Claims Nos. : 31-34 



The search was based on the sequence listing furnished in computer 
readable form, the numbering of which differs from the numbering in the 
figures. 

Claims 31-34 refer to a 'substance that modulates the activity of a 
kinase' without giving a true technical characterization. Moreover, no 
such specific compounds are defined in the application. In consequence, 
the scope of said claims is ambiguous and vague, and their subject-matter 
is not sufficiently disclosed and supported (Art. 5 and 6 PCT). 
No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the results to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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